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SUMMARY 


(1)  TECHNICAL  OBJECTIVES  AND  ISSUES  UNDER  INVESTIGATION; 

Working  with  realistic  modal  descriptions  of  vibratory  mechanical  systems,  the  technical 
objective  was  to  design  active  damping  controllers.  The  systems  of  interest  are  multiple  input, 
multiple  output  (MIMO),  heavily  damped  structures  with  complex  modes.  The  control  objective 
is  to  suppress  the  effects  of  impulsive  disturbances  within  a  short  period  of  time  as  well  as  to 
provide  a  specified  reduction  of  vibration  due  to  ever-present  stochastic  disturbances. 

Systems  with  complex  modes  cannot  be  decoupled  exactly.  One  issue  that  was  resolved 
was  the  impracticality  of  using  a  controller  for  an  approximately  decoupled  system.  Simulations 
showed  that  the  desired  amount  of  damping  could  not  be  achieved  with  this  controller  due  to 
stability  problems  caused  by  the  approximations. 

The  technical  issue  regarding  state  estimation  was  to  develop  an  optimal  estimator  for  the 
system  driven  by  non-gaussian  stochastic  signals. 


(2)  TECHNICAL  APPROACHES: 

The  team  effort  can  be  viewed  as  three  interactive  parts;  one  group  investigating  the 
robust  control  issues;  one,  the  non-gaussian  estimation  problem  and  a  third  group  working  with 
a  mechanical  plate  experiment,  generating  data  and  identification  algorithms.  The  experimental 
model  was  constructed  in  the  M.E.  Department  under  DARPA  funding.  Our  group  interacted 
well  with  the  M.E.  group,  sharing  ideas  and  methods  where  relevant. 

The  robust  control  investigators  used  modem  analysis  and  synthesis  tools  to  incorporate 
the  required  performance  directly  into  the  design  procedure.  The  approach  to  handle  the  non- 
gaussian  estimation  problem  was  the  use  of  a  modified  gaussian  sum. 


(3)  SUMMARY  OF  ACCOMPLISHMENTS; 

General  Discussion 

Our  effort  began  with  the  design  of  a  baseline  controller.  The  system  of  interest  was  a 
MIMO,  heavily  damped  structure  with  complex  modes,  and  the  control  objective  was  to  suppress 
impulsive  disturbances  in  a  given  time  interval  and  to  provide  a  specified  degree  of  suppression 
to  an  ever-present  stochastic  disturbance  for  which  we  have  frequency  domain  information.  In 
order  to  limit  the  complexity  of  our  controller,  a  decoupled,  independent  modal  control  approach 
was  explored.  Although  a  system  with  complex  modes  cannot  be  decoupled  exactly,  an  attempt 


Page  1 


was  made  to  design  a  controller  for  an  approximately  decoupled  system.  Simulations  showed, 
however,  that  the  desired  increase  in  damping  could  not  be  achieved  with  this  controller  due  to 
stability  problems  brought  about  by  the  approximations.  The  decoupling  approach  was 
abandoned  and  replaced  by  a  standard  estimator-feedback  control  structure.  To  address  both 
types  of  disturbances  simultaneously,  an  LQG/LTR  methodology  was  used,  after  suitable 
modification  to  account  for  the  presence  of  the  feed-through  term  in  our  system  that  was  due  to 
the  use  of  accelerometers  as  sensors.  This  provided  an  acceptable  baseline  solution  to  our 
problem  to  which  we  could  compare  our  ideas  for  improvement.  One  area  in  which  the  baseline 
design  was  deficient  was  robust  stability  to  unstructured  uncertainty.  The  solution  to  this 
problem  was  to  incorporate  robust  stability  and  performance  requirements  directly  into  the 
design  procedure. 


The  Problem  of  Structured  Uncertainties 

Robust  control  methods  were  studied  for  application  to  vibrational  systems  whose 
uncertainties  have  known  structure.  Structured  uncertainties  arise,  for  example,  in  plants  with 
uncertain  parameters  in  the  state  space  model.  Therefore,  vibrational  systems  with  uncertain 
modal  frequencies  and  damping  ratios  are  well  suited  for  structured  uncertainty  designs.  The 
purpose  of  such  methods  is  to  take  advantage  of  the  structure  of  plant  uncertainties  in  order  to 
reduce  the  conservativeness  of  the  compensator  design.  Structured  uncertainty  designs 
accomplish  this  by  guaranteeing  stability  robustness  only  to  uncertainties  of  the  specified 
structure,  thereby  sacrificing  as  little  performance  as  possible. 

The  chosen  design  method  is  a  refinement  of  a  modified  Linear  Quadratic  Gaussian 
method  by  Jong- Yin  Lin  and  others.  Lin  models  parameter  uncertainties  by  isolating  constants 
of  unknown  magnitude  in  ficticious  feedback  paths  and  treating  the  return  signals  as  gaussian 
white  noise  entering  the  plant  with  direction  determined  by  the  nature  of  the  uncertainty.  The 
proposed  method  exploits  the  fact  that  these  return  signals  are  neither  gaussian,  nor  white  noise. 
Rather,  they  depend  on  the  magnitudes  of  the  states,  controls,  and  measurements  of  the  system. 
A  more  accurate  modeling  of  the  parameter  uncertainties  will  normally  yield  compensator 
designs  with  better  performance  for  the  same  level  of  stability  robustness.  This  improvement 
is  achieved  without  an  increase  in  compensator  order  or  a  loss  of  optimality.  Also,  the  optimal 
reduced-order  compensator  equations  of  Hyland  and  Bernstein  adapt  naturally  to  the  proposed 
design  method  and  allow  an  additional  tradeoff  between  control  system  performance  and 
compensator  order  reduction. 

Results  in  Non-Gaussian  Estimation 

Optimal  state  estimation  has  been  accomplished  for  a  linear  system  driven  by  an  unknown 
non-gaussian  input  with  additive  white  noise,  and  observed  by  measurements  containing 
feedthrough  of  the  same  nonGaussian  input  and  corrupted  by  additional  white  Gaussian  noise. 
One  approach  considered  that  can  cope  with  the  non-gaussian  nature  of  the  input  signal  is  the 
"gaussian  sum  technique",  where  the  probability  density  function  of  the  non-gaussian  input  is 
approximated  by  a  weighted  sum  of  gaussian  density  functions.  An  adaptive  filter  based  on  this 
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technique,  however,  has  the  limitation  that  the  number  of  terms  in  the  gaussian  sum  grows  at 
each  iteration  of  the  filter.  A  modified  approach  has  been  developed  in  this  research  which 
results  in  an  ad2q>tive  filter  with  a  fixed  number  of  terms  at  each  iteration. 

As  a  result  of  this  modified  gaussian  sum  research,  a  necessary  condition  for  effective 
estimation  has  been  established.  It  has  been  found  that  the  value  of  the  DC  gain  of  the  linear 
system  provides  a  simple  test  to  determine  whether  or  not  the  modified  estimation  scheme  will 
prove  effective.  Severn  alternate  estimation  methods  have  been  established  when  this  condition 
is  not  met.  Also,  a  suboptimal  method  of  tuning  the  parameters  of  the  adaptive  filter  structure 
to  provide  enhanced  performance  has  been  investigated. 

This  moditied  estimation  technique  proved  effective  in  the  problem  of  robust  modal  control 
of  large,  MIMO,  heavily  damped  structures.  The  adaptive  filter  was  found  to  provide  high 
quality  estimates  of  modsd  position  and  velocity  required  by  the  robust  control  system. 

Progress  on  System  Identification 

Effective  and  efficient  system  identification  techniques  for  discrete-time,  linear,  MIMO, 
heavily  damped  modal  systems  from  input/output  sequences  have  been  developed  and  simulated. 
This  facilitated  a  better  understanding  of  the  possible  errors  in  the  estimated  model  and  a  more 
accurate  compensator  and  estimator  design. 

Three  different  time  domain  identification  techniques  have  been  developed  in  this 
research.  The  first  technique  is  known  as  Pseudo-Linear  Identification  (PLID)  for  simultaneous 
state  estimation  and  parameter  identification.  The  method  involves  forming  an  augmented 
system  that  is  nonlinear,  due  to  the  multiplication  of  the  state  by  parameters.  The  augmented 
state  vector  is  estimated  by  a  Kalman  filter,  which  has  a  state  matrix  that  depends  on  the  input 
and  output  data. 

The  second  algorithm  developed  determines  the  state-space  model  in  a  pseudo- 
controllable/observable  canonical  form.  The  advantage  of  this  technique  is  its  ability  to 
determine  structural  information,  which  means  determination  of  controllability/observability 
indices.  A  MIMO  system  is  obtained  in  pseudo  controllable/observable  form,  based  on  a  set  of 
admissible  pseudo-  controllable/observable  indices. 

The  last  identification  algorithm  developed  in  this  research  is  a  computational 
simplification  of  the  Eigensystem  Realization  Algorithm.  The  ERA  algorithm  involves  singular 
value  decomposition  of  the  Hankel  matrix  formed  by  the  Markov  parameters.  This  Hankel 
matrix  grows  as  the  system  order  and  number  of  inputs  and  outputs  increase.  With  the 
computational  simplification,  a  minimal  state  space  representation  is  obtained  by  a  simple  and 
appropriate  selection  of  the  columns  or  rows  from  the  Hankel  matrix.  The  identified  state  space 
m^el  is  obtained  in  a  canonical  form. 

These  system  identification  techniques  have  been  verified  on  a  simply  supported 
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rectangular  plate  experimental  set-up.  The  results  achieved  were  very  encouraging.  These 
techniques  could  be  easily  applied  to  a  more  complex  modal  structure. 


(4)  SIGNIFICANCE  OF  ACCOMPLISHMENTS 

This  project  has  provided  results  in  the  direction  of  a  complete  control  algorithm  for 
structural  vibration  damping.  When  the  various  research  results  came  together,  the  control 
algorithm  was  found  to  be  capable  of  reducing  critical  modes  of  vibration  in  response  to 
persistent  or  impulsive  non-gaussian  disturbances. 


(5)  PUBLICATIONS  AND  PRESENTATIONS: 


Copies  of  the  following  papers  are  included  with  this  report. 

Journal  Papers: 

"A  Computationally  Efficient  Technique  for  State  Estimation  of  Nonlinear  Systems", 
Automatica.  February  1992,  (J.S.Dhingra,  R.L.Moose,  H.F.VanLandingliam  and 
T.A.Lauzon). 

"Nonlinear  Effects  of  a  Modal  Domain  Optical  Fiber  Sensor  in  a  Vibration  Suppression 
Control  Loop  for  a  Flexible  Structure",  ASMS  Journal  of  Vibration  and  Acoustics,  to 
appear.  (D.  K.  Lindner,  G.  A.  Zvonar,  W.  T.  Baumann  and  P.  L.  Delos). 

"A  Modified  Gaussian-Sum  Approach  to  Estimation  of  Non-Gaussian  Stochastic  Signals", 
IEEE  Transactions  on  Aerospace  and  Electronic  Systems,  to  appear  (July  1992).  (M.  J. 
Caputi  and  R.  L.  Moose). 


International  Conference: 

"Measurement  and  Control  of  Flexible  Structures  Using  Distributed  Sensors",  29th  IEEE 
Conference  on  Decision  and  Control.  December  1990.  ( D.K.Lindner,  K.M.  Reichard, 
W.T.Baumann  and  M.  F.  Barsky  ). 

"Application  of  Fuzzy  Logic  Control  to  Active  Vibration  Damping",  Eighth  VPI  &  SU 
Symposium  on  Dynamics  and  Control  of  Large  Structures.  May  1991.  (A.  Tsoukkas  and 
H.  F.  VanLandingham). 


Page  4 


National  and  Regional  Conferences: 


"Deterministic  Identification  of  Linear  Multivariable  Systems",  22nd  Southeastern 
Symposium  on  System  Theory.  March  1990.  (  B.Gorti,  S.Bingulac  and 

H.F.VanLandingham). 

"A  Computational  Procedure  for  Stabilizing  Lightly  Damped  Systems",  21st  Modeling 
and  Simulation  Conference.  May  1990.  (  B.Gorti,  S.Bingulac  and  H.F.VanLandingham). 

"Estimation  of  Non-Gaussian  Signals  Using  a  Modified  Gaussian  Sum  Ad^tive  Filter", 
28th  Allerton  Conference.  October  1990.  (  M.J.Caputi,  R.L.Moose  and  W.T.Baumann). 

"Computational  Simplification  in  Eigensystem  Realization  Algorithm",  28th  Allerton 
Conference.  October  1990.  (  B.Gorti,  S.Bingulac  and  H.F.VanLandingham). 


(6)  PARTICIPANT  LIST: 

William  T.  Baumann,  co-investigator 
Richard  L.  Moose,  co-investigator 
Hugh  F.  VanLandingham,  co-investigator 
Stanoje  Bingulac,  consultant 

Mauro  J.  Caputi,  Ph.D.  student,  degree  awarded  in  May  1991 
Stephen  H.  Jones,  Ph.D.  student,  degree  awarded  in  July  1991 
Bhaskar  M.  Gorti,  M.S.  student,  degree  awarded  in  February  1991 


Page  5 


Table  of  Contents 


Page 

0.  Introduction .  1 

0.1  Purpose  and  Importance  of  This  Work .  1 

0.2  Research  Objectives .  2 

1.  Motivation  for  This  Study .  4 

1.1  Criteria  for  the  Merit  of  Candidate  Designs .  4 

l.lA.  Nominal  Performance .  4 

l.lB.  Performance/Stability  Robustness  Tradeoff .  8 

l.lC.  Compensator  Order .  16 

l.lD.  Steady  State,  Transient  Disturbance  Rejection .  17 

1.2  Contributions .  17 

2.  Frequency-Shaped  Noise .  19 

2.1  Problem  Statement .  19 

2.2  Noise  Shaping  Filter  Design . 21 

2.3  Controller  Design  Summary .  23 

3.  Multiplicative  White  Noise . 25 

3.1  Problem  Statement .  25 

3.2  Preliminaries  of  Stochastic  Differential  Equation  Theory .  28 

3.3  Conversion  to  Deterministic  Minimization  Problem .  30 

3.4  Derivation  of  the  Necessary  Conditions .  35 

3.5  Modification  for  Stratonovich  Noise  Model .  41 

3.6  Controller  Design  Summary .  42 

4.  Optimal  Reduced-Order  Control .  44 


V 


5.  Methods  <uid  Algorithms  for  Controller  Design .  49 

5.1  Selection  of  Cost  Functional  Weighting  Matrices  and 

Covari^lnce  Matrices .  49 

5.2  Disturbance  Cancellation  and  Inverse  Optimal  Control .  53 

5.3  Discrete-Time  Controller  Design .  58 

5.3A.  Conversion  of  Cost  and  Covariance  Matrices  to  Discrete-Time  58 
5.3B.  Computational  Delay .  61 

5.4.  Iterative  Relaxation  Algorithm  for  Solving  Coupled 

Riccati/Lyapimov  Equations .  62 

5.5  Homotopy  Algorithm  for  Solving  Optimal  Projection  Equations .  67 

6.  Evaluation  of  Designs .  75 

6.1  Problem  Description .  75 

6.2  Comparison  of  Tradeoff  for  Different  Methods .  82 

6.2A.  Uncertainty  in  Natural  Frequency .  83 

6.2B.  Uncertainty  in  Eigenvector .  86 

6.2C.  Results  for  Different  Factorizations .  88 

6.3  ltd  vs.  Stratonovich  Noise .  93 

6.4  Reduced-Order  Controller .  94 

7.  Application  to  Simply  Supported  Plate .  101 

7.1  Stability  Robustness  Problem  and  Its  Solution .  101 

7.2  Effectiveness  of  Reduced-Order  Controllers .  104 

8.  Conclusions .  Ill 

8.1  Robustness .  Ill 

8.2  Controller  Order .  112 

8.3  Parameter  Robust  Reduced-Order  Design .  113 

8.4  Directions  for  Further  Study .  113 

References .  114 

Vita .  118 


List  of  Figures 


Page 

Figure  1.1:  Nominal  Performance  for  LQG  vs.  Dto^-Optinial  Control .  7 

Figure  1.2:  PRLQG  Error  Model .  10 

Figure  1.3:  LQG/PRE  Error  Model .  10 

Figure  1.4:  Time-Domain  Auxiliary  Input  Models .  14 

Figure  1.5:  Frequency-Domain  Auxiliary  Input  Models .  15 

Figure  2.1:  Model  Used  to  Approximate  Power  Spectral  Densities  of 

Auxiliary  Inputs .  20 

Figure  2.2:  Plant  Augmented  with  Noise  Shaping  Filters .  20 

Figure  6.1:  Frequency  Responses  of  Augmented  Plant  Components  for 

4-Mode  Model .  81 

Figure  6.2:  Steady  State  Performance  and  Stability  Tradeoff  for  Uncertainty 

in  Natural  Frequency .  85 

Figure  6.3:  Transient  Performance  and  Stability  Tradeoff  for  Uncertainty 

in  Natural  Frequency .  87 

Figure  6.4:  Steady  State  Performance  and  Stability  Tradeoff  for  Uncertainty 

in  Eigenvector .  89 

Figure  6.5:  Transient  Performance  and  Stability  Tradeoff  for  Uncertainty 

in  Eigenvector .  90 

Figure  6.6:  Comparative  Performance  for  Several  Different  LQG/PRE 

Factorizations .  92 

Figure  6.7:  Frequency  Responses  for  Full-  and  Reduced-Order  Compensators.  96 

Figure  6.8:  Disturbance  Rejection  for  Full-  and  Reduced-Order  Compensators  97 

Figure  6.9:  Frequency  Response  of  Full-  and  Reduced-Order  Compensators....  99 

Figure  6.10:  Time  Response  for  Full-  and  Reduced-Order  Compensators .  100 

Figure  7.1:  Performance/Stability  Robustness  Tradeoff  for  Uncertainty  in  Wj.  103 

Figure  7.2:  Robust  Control  —  Mode  1  Response  to  60  Hz  Disturbance .  105 

Figure  7.3:  Robust  Control  —  Mode  2  Response  to  60  Hz  Disturbance .  106 


108 


Figure  7.4:  Response  of  Full-Order  Robust  Controllers  With  and  Without 

Time  Delay . 

Figure  7.5:  Robust  Reduced-Order  Control  —  Mode  1  Response  to  60  Hz 

Disturbaince .  109 

Figure  7.6:  Robust  Reduced-Order  Control  —  Mode  2  Response  to  60  Hz 

Disturbance .  110 


List  of  Tables 


Page 

Table  6.1:  Natural  Frequencies  juid  Damping  Ratios  for  the  First  Nine  Modes  77 
Table  6.2:  Controller  Design  Parameters  Corresponding  to  Eight  Different 


Factorizations .  91 

Table  6.3:  LQ  Cost  and  Disturbance  Rejection  vs.  Controller  Order .  95 

Table  7.1:  Sample  Rates  for  Full*  and  Reduced-Order  Controllers .  107 


tx 


0.  Introduction 


0.1  Purpose  and  Importance  of  This  Work 

The  purpose  of  this  research  is  to  explore  two  new  and  related  methods  of 
designing  compensators  for  the  active  control  of  uncertain  systems.  Both  methods  are 
designed  to  improve  the  robustness  of  a  control  system’s  stability  and  performance  with 
respect  to  uncertainty  in  selected  parameters  of  the  state-space  model  of  the  plant. 
They  accomplish  this  by  modeling  the  effect  of  uncertainties  as  accurately  as  possible 
by  fictitious  noise  sources  while  simultaneously  reducing  sensitivity  by  means  of 
additional  penalties  in  the  cost  functional.  The  first  method  constructs  the  noise  model 
in  the  frequency  domain,  whereas  the  second  formulates  noise  in  the  time  domain.  In 
addition,  both  incorporate  optimal  order  reduction  directly  into  the  design.  These 
methods  are  expected  to  have  general  application  to  plants  with  imcertain  state-spcice 
parameters;  however,  they  will  be  applied  in  this  work  to  the  problems  involved  in  the 
suppression  of  disturbances  in  flexible  structures,  which  suffer  particularly  firom  the 
difficulties  of  large  plant  uncertainty  and  large  compensator  order. 

Modeling  and  identification  of  flexible  structures,  themselves,  constitute  a 
difficult  problem  and  an  active  area  of  research  (e.g.,  Balas  and  Doyle  1990).  The 
prevalence  of  this  type  of  research  is  an  indicator  that  linejir,  finite-dimensional,  time- 
invariant  (LFDTI)  models  of  flexible  structures  are  examples  of  highly  imcertain  plants, 
md  it  points  to  the  need  for  robust  controllers  for  such  systems. 

Also,  LFDTI  models  of  flexible  structures  tend  to  be  of  very  high  order.  Firstly, 
since  flexible  structures  are  infinite-dimensional,  it  is  desirable  to  consider  as  many 
vibrational  modes  as  possible  in  order  to  reduce  the  effect  of  unmodeled  dynamics  on 
the  control  system.  Secondly,  it  is  frequently  necessary  to  augment  the  plant  with 
additional  dynamics  in  order  to  model  disturbances  more  accurately  or  to  meet  more 
precise  performance  specifications.  Disturbmce  modeling  (e.g.,  Kwakernaak  and  Sivan 
1972,  sec.  1.11.4)  is  desirable  when  the  disturbances  are  correlated  and  something  is 
known  about  their  frequency  content.  This  allows  the  exogenous  signals  to  be  modeled 
by  white  noise  processes  (in  the  case  of  LQG-based  designs),  provided  the  disturbance 
d}rnamics  are  appended  to  the  plant  at  the  disturbance  inputs.  Likewise,  frequency¬ 
shaped  cost  functionals  (Gupta  1980)  are  used  when  the  state  and  control  weighting 
matrices  are  functions  of  frequency.  This  is  accomplished  in  the  LQG  framework  by 
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appending  dynzimics  to  the  plant  at  the  controlled- variable  outputs.  It  has  been  shown 
(Sievers  and  von  Flotow  1989)  that  these  two  extensions  to  LQG  theory  are  duals  and, 
in  the  single-input  single-output  (SISO)  case,  equivalents  of  one  another.  Also,  the 
same  two  methods  are  used  for  analogous  purposes  in  MoQ-based  design  (e.g.,  Doyle 
1984). 

In  either  case,  an  LQG  or  !Koo-optimal  compensator,  being  of  the  same  order  as 
the  augmented  plant,  must  also  be  of  high  order.  As  a  result,  full-order  controllers  tend 
to  be  of  very  high  order  and  therefore  place  a  great  computational  burden  on  the  real¬ 
time  processor  hardware. 

0.2  Research  Objectives 

The  proposed  parameter  robust  reduced-order  control  methods  are  described  in 
Chapter  1  and  developed  in  Chapters  2  through  5.  Then,  the  resulting  designs  are 
applied  to  a  continuous-time  FDLTI  model  of  a  simply  supported  rectangular  plate  (in 
Chapter  6),  as  well  as  to  the  actual  hardware  (in  Chapter  7).  The  control  system  has 
one  control  input  actuator,  one  disturbance  input  actuator,  and  twelve  accelerometer 
sensors.  The  state-space  model  of  the  plate  has  been  derived  from  modal  frequency  and 
modeshape  data  obtained  by  an  identification  procedure.  This  model  is  augmented  by 
the  dynamics  associated  with  a  contol  signal  smoothing  filter  and  a  noise  shaping  filter 
designed  to  reflect  the  characteristics  of  a  colored  noise  disturbance.  Anadysis  has 
shown  the  stability  of  the  closed-loop  system  to  be  most  susceptible  to  errors  in  the 
natural  frequencies  and  control  input  modeshape  vector,  and  experimentation  has 
confirmed  this.  Therefore,  emphatsis  is  planed  on  makking  the  system  less  sensitive  to 
errors  in  the  parameters  of  the  state-space  model  corresponding  to  these  quantities. 

There  ane  three  main  objectives  to  this  research.  The  first  objective  is  to 
evaluate  the  efficiency  of  the  performance/stability  robustness  tradeoff  for  the  two 
proposed  design  methods.  This  is  done  by  comparing  compensators  designed  by  the 
frequency-domain  and  time-domain  noise  modeling  techniques  (of  Chapters  2  and  3, 
respectively)  with  those  designed  by  two  existing  methods  discussed  in  Chapter  1: 
LQG/PRE  and  multiplicative  white  noise  modeling  (without  the  auxiliau^y  output 
modeling  feature  used  in  Chapter  3).  First,  a  parameter  range  is  specified  for  which  the 
system  must  be  stable.  This  parameter  range  is  laurger  than  the  stability  range  obtained 
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by  the  standard  LQG  design.  Second,  full-order  compensators  are  designed  by  all  four 
tf^chniques  to  just  meet  this  stability  robustness  specification,  using  the  scime 
performance  criterion  zmd  assumed  exogenous  noise  covariances  used  by  the  LQG 
'design.  Finally,  it  is  determined  how  much  performance  was  sacrificed  to  attain  this 

I 

I  level  of  robustness  by  meaisuring  the  performance  of  the  designs  (i.e.,  the  quadratic  cost) 
over  the  stability  range  of  interest.  This  method  is  applied  to  a  simple  1-mode  model  of 
the  plate  in  Chapter  6  to  illustrate  the  design  considerations  involved.  Uncertainties  in 
both  the  control  input  modeshape  vector  (i.e.,  the  controller  actuator  location)  amd  the 
naturad  frequency  axe  considered.  In  Chapter  7,  a  4-mode  model  of  the  plate  is  analyzed 
to  determine  the  cause  of  the  poor  stability  properties  of  the  actuad  hairdwaure.  Then  the 
most  sxiitable  robust  controller  is  selected  amd  implemented  on  the  plate. 

The  second  objective  is  to  investigate  the  extent  to  which  the  order  of  a  standard 
LQG  compensator  can  be  reduced  for  FDLTI  models  of  the  rectamgulair  plate.  When 
one  adds  the  states  of  a  second-order  smoothing  filter  and  a  noise  shaping  filter  to  the 
two  states  per  vibrational  mode  of  the  plate,  the  high  order  of  the  plamt  becomes 
apparent,  ais  does  the  need  for  a  reduced-order  compensator.  The  procedure  is  as 
follows.  First,  a  full-order  LQG  compensator  is  designed  to  provide  satisfactory  nominal 
performance.  Then  reduced-order  compensators  are  designed  using  the  optimal 
projection  equations,  and  it  is  determined  how  far  the  order  can  be  reduced  without 
significant  degradation  in  performance.  This  is  carried  out  in  Chapter  6,  and  the  results 
of  implementation  in  hardware  a^e  described  in  Chapter  7. 

The  third  and  final  objective  is  to  combine  the  best  modified  LQG/PRE  design 
method  with  the  optimal  projection  equations  to  determine  the  overall  merit  of  the  new 
robust,  minimal  order  design.  This  work  is  carried  out  in  Chapter  7. 
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1.  Motivation  for  This  Study 


The  ideas  developed  in  this  work  were  motivated  by  the  shortcomings  of  existing 
Dtj-  and  Dtoo-based  techniques  for  rejecting  disturbances  in  uncertain  systems.  In  this 
chapter,  standards  are  defined  for  the  practical  merit  of  a  controller,  and  two 
approaches  are  proposed  to  better  meet  those  standards  than  do  existing  methods. 
Then  the  specific  contributions  of  this  work  are  outlined. 

1.1  Criteria  for  the  Merit  of  Candidate  Designs 

The  standards  that  will  be  used  to  judge  and  compjire  the  effectiveness  of  various 
compensator  designs  are:  1)  nominal  disturbance  rejection  performance;  2)  “efficiency” 
of  tradeoff  between  performance  and  stability  robustness;  3)  compensator  order;  and 
4)  ability  to  reject  both  steady  state  and  transient  disturbances.  What  follows  is  a 
discussion  of  these  criteria  and  the  means  by  which  they  will  be  met. 

A.  Nominal  Performance 

The  performance  of  a  compensator  will  be  measured  by  its  ability  to  minimize 
the  effect  of  unwanted  exogenous  signals  on  the  prescribed  controUed-variable  vector, 
whose  elements  consist  of  weighted  linear  combinations  of  system  states  and  controls. 
Nominal  performance  refers  to  the  performance  of  a  control  system  with  all  plant 
parameters  equal  to  their  assumed  values.  Two  competing  classes  of  performance 
optimality  have  been  considered,  LQG  and  Mqo.  These  two  definitions  of  optimality 
differ  in  the  way  they  model  the  exogenous  signals  and  in  the  way  they  meMure  the 
magnitude  of  the  resulting  disturbance  in  the  controlled  variables. 

In  the  traditional  LQG  formulation  of  the  disturbance  rejection  problem,  optimal 
performance  is  defined  to  be  that  which  minimizes  the  cost  functional 

^  I  (1-1) 

in  the  presence  of  unit  intensity,  uncprrelated  white  process  and  sensor  noise  (v  aind  n, 
respectively).  For  the  purpose  of  easier  comparison  with  K^o-based  methods,  an 
equivalent  formulation  will  be  made  in  the  frequency  domain.  Define  the  noise  vector, 
w,  and  the  controUed-variable  vector,  z,  as  follows: 
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(1.2) 


w 


and  denote  the  closed-loop  transfer  function  matrix  from  lu  to  ^  by  H{s).  Then  the  Stj- 
optimal  compensator  is  that  which  minimizes  the  Dt2  iiorm  of  H{s)  (i.e.,  ||ff(5)|[i).  The , 
fact  that  the  LQG  and  definitions  of  optimal  performance  are  equivalent  are  an 
immediate  consequence  of  the  following  theorem: 

Theorem.  J  =  |l^(s)|l,^ 

Proof.  Using  the  subscript  “7^,  define  time-truncated  functions  as  follows: 

/ 

=  x(t),  —'^<t<T\ 

—  0,  otherwise 

Then  / 

=feT  e{  //’■(<«')<*<} 


1  °r  *  m  •  M-  1  ■  X 

=  \H[s)i\ 


du 


In  the  proof  of  the  theorem,  use  was  made  of  the  definition  of  the  %2  operator 
norm  (e.g.,  Dailey  1990),  Parseval’s  Theorem,  and  the  fact  that  the  covariance  of  the 
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assumed  disturbance  vector,  u>,  is  the  identity  matrix. 

In  Dtoo  controller  design,  the  same  closed-loop  transfer  function  is  considered,  and 
optimality  of  performance  is  achieved  when  its  Utoo  nonn  (i.e.,  ||H(s)||(,o)  is  minimized. 
However,  the  disturbances,  u>,  and  the  controlled  variables,  z,  are  viewed  differently. 
The  %ac>  operator  norm  is  the  induced  Lj  norm,  meaning 

|^W|»=  s“P  1^(0  It  (1-3) 

ll«(t)lb  =  » 

Therefore,  w  and  z  are  considered  to  be  L2  (i.e.,  square  integrable)  signals.  This 
expression  for  the  Moj,  norm  reveals  a  possible  drawback  to  Dto^'based  control  methods 
for  disturbance  rejection.  Since  the  Moo  controller  is  designed  for  the  worst  case  umty- 
norm  disturbance,  no  consideration  is  given  to  the  relative  intensities  of  multiple 
noise  sources.  The  worst  case  disturbance  may  be  one  whose  entire  power  is  delivered 
to  a  single  process  noise  or  sensor  noise  port.  LQG-based  methods,  on  the  other  hand, 
model  the  noise  intensity  at  each  port  separately. 

Another  expression  for  the  norm  is  given  by  its  definition, 

where  a  denotes  the  maximum  singular  value.  This  expression  shows  that  the  Dfegp- 
optimal  compensator  is  the  one  which  minimizes  the  worst  case  disturbance  rejection 
over  all  firequencies.  In  contrast,  the  LQG  compensator  provides  better  disturbance 
rejection  averaged  over  all  frequencies.  This  principle  is  illustrated  in  the  singular  value 
plot  of  a  closed-loop  vibrational  system  in  Figure  1.1.  Using  the  same  state  and  control 
weightings,  Q  and  iZ,  an  LQG  and  an  M*,  compensator  were  designed  to  reject 
disturbances  in  an  eight-mode  model  of  a  flexible  structure.  The  two  cxirve.  indicate 
the  mavimiiTTi  singular  value  of  the  transfer  function  matrix,  H{s),  for  the  two  different 
compensator  designs.  The  M^p  compensator  provided  slightly  better  worst  case 
performance,  but  at  the  cost  of  significantly  worse  performance  averaged  over  the  entire 
frequency  range. 

Because  of  these  two  potential  problems  associated  with  disturbance  rejection  in 
Mpp-based  designs  (i.e.,  less  precise  noise  modeling  and  less  practical  definition  of 
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Max.  Singular  Value 


optimality)  preferential  consideration  was  given  to  improving  LQG-based  design 
methods  in  the  expectation  that  they  would,  by  convention2il  measures,  provide 
significantly  better  nominal  performance. 

B.  Performance/Stability  Robustness  Tradeoff 

For  uncertain  systems,  K^-based  methods,  such  as  //-synthesis  (Doyle  1985), 
have  the  advantage  that  plant  uncertainties  with  singular  value  bormds  can  be 
incorporated  directly  into  the  design,  without  the  need  for  fictitious  noise  sources  to 
model  the  uncertainties.  However,  these  methods  also  have  serious  limitations,  as 
illustrated  in  their  recent  application  to  a  flight  control  problem  (Doyle,  Lenz,  and 
Packard  1987).  Firstly,  controllers  do  not  provide  optimal  performance  and 
stability  robustness  simultaneously.  For  example,  an  controller  designed  for 
performance  may  yield  very  poor  stability  margins.  Secondly,  the  /i-s3mthesis  technique 
corrects  this  problem  by  attempting  an  optimal  tradeoff  between  performance  and 
stability  robustness,  but  the  resulting  controllers  tend  to  be  of  very  high  order. 
Therefore,  ad  hoc  controller  and/or  model  reduction  schemes  are  required  to  reduce  the 
controller  order  to  a  practical  size. 

An  LQG-based  method  can  avoid  these  problems,  provided  that  its  noise  models 
eulequately  represent  the  uncertainties  without  greatly  increasing  the  order  of  the 
compensator.  For  this  reason,  the  main  thrust  of  this  work  is  to  improve  on  an  existing 
LQG-based  robust  control  method  by  more  acciirately  modeling  the  plant  \mcertainties 
by  means  of  more  suitable  fictitious  noise  sources.  The  purpose  of  doing  so  is  to  give 
the  resulting  compensators  the  same  degree  of  stability  robustness  with  a  smaller 
sacrifice  in  performance.  The  degree  to  which  this  occurs  will  be  referred  to  as  the 
“efficiency”  of  the  performance/stability  robustness  tradeoff. 

Tahk  and  Speyer  (1987)  introduced  a  modified  LQG  synthesis  procedure,  later 
called  parameter  robust  LQG  (PRLQG),  in  which  perturbations  in  the  elements  of  the 
A-matrix  are  modeled  by  a  fictitious  internal  feedback  loop.  Given  a  nominal  system 
matrix,  A,  and  the  perturbed  matrix.  A,  define  the  perturbation  matrix  by 
AA  =  A  —  A.  Then  f2u:tor  it  as  follows:  AA  =  MLN,  where  M  and  N  are  of  full  rank 
(to  minimize  their  dimensions)  2aid  L  is  without  loss  of  generality  a  diagonad  matrix, 
whose  diagonal  elements  reflect  the  (unknown)  magnitude  of  the  perturbations.  An 
accurate  model  of  the  perturbed  system  results  after  the  addition  of  this  internal 
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feedback  loop,  as  shown  in  Figure  1.2.  Now,  given  the  nominal  system, 

x  =  Ax  +  Bu  +  GiW  (1.5) 

y  =  Cx  +  G2W 

the  perturbed  system  is  described  by 

x  =  Ax  +  Bu  +  GiW  +  Mwa  (1.6) 

y  =  Cx  +  G^w 
2^  =  Nx 
Wa  =  LZa 

where  amd  are  auxiliary  input  and  output  variables,  respectively.  Intuitively,  one 
may  suspect  that  the  robustness  of  the  system  to  the  prescribed  parameter  variations 
wotdd  be  enhanced  if  were  modeled  as  white  noise,  thereby  adding  fiMM^  (for  some 
scalar  /i)  to  the  process  noise  covariance.  Actually,  Tahk  amd  Speyer  showed  that  under 
certain  minimxim  phaise  and  similarity  conditions,  the  robustness  of  am  LQ  regulator  is 
recovered  asymptotically  as  (i-too.  Note  that  this  robustness  recovery  technique  is 
identical  to  LQG/LTR  for  uncertainties  at  the  input  (Doyle  amd  Stein  1981),  provided 
M  =  B.  However,  the  PRLQG  method  hais  the  aulvamtage  that  M  contains  pautiad 
information  ais  to  the  structure  of  the  perturbation,  AA.  In  fact,  Tadik  and  Speyer 
showed  that  LQG/LTR  fails  to  aisymptotically  desensitize  the  estimator  (when  PRLQG 
succeeds)  for  certain  structures  of  AA,  since  B  does  not  necessarily  paiss  a  crucial 
similarity  condition  with  respect  to  the  matrix  AA,  namely  that  spam{M }  C  span{B} 
for  some  AfLiNT-factorization  of  A  A.  Vibration  control  examples  have  demonstrated 
that  in  such  cases  LQG/LTR  may  provide  relatively  poor  robustness  to  parameter 
uncertaiinties  (Tahk  amd  Speyer  1989).  PRLQG  also  includes  the  dual  of  the  above 
procedure  —  amalogous  to  LQG/LTR  for  mcertainties  at  the  output.  In  the  dual 
procedure,  z^  is  treated  as  am  auxiliary  controlled  variable,  amd  the  term  pN^N  (for 
some  scalar  p)  is  added  to  the  state  weighting  matrix  in  the  regulator  design.  This 
procedure  adso  makes  sense  intuitively,  as  it  seems  reaisonable  to  penailize  the  auxiliary 
output  variable  in  the  cost  functional  in  order  to  minimize  the  magnitude  of  the 
auxiliary  inputs  that  result.  Combining  the  two  procedures  ais3rmptotically  yields 
“absolute  robustness”  (i.e.,  stability  robustness  to  parameter  uncertainties  of  arbitrary 
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Figure  1.3:  LQG/PRE  Error  Model 
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magnitude)  as  n,  p  -*  oo. 

Lin  (1989)  studied  the  combined  effect  of  both  procedures  in  more  detail  and  <dso 
extended  the  method  to  include  parameter  variations  in  the  B-  and  C-matrices  in  a  way 
that  is  less  cumbersome  than  that  suggested  by  Tahk  and  Speyer.  This  extended 
method  was  termed  LQG/PRE  (standing  for  Linear  Quadratic  Gaussian  design  with 
Parameter  Robustness  Enhancement).  The  resulting  model  of  the  plant,  with 
perturbations  in  A,  B,  and  C  is  shown  in  Figure  1.3.  The  system  equations  are: 

X  =  Ax  +  Bu  +  GiW  +  + (1.7) 

y  =  Cx  +  G2W  + 

=  N^x,  Zfc  =  Nf,u,  =  N^x 

where 

AA  4  AB  ^  AC  ^  (1.8) 

Given  the  nominal  regulator  weighting  matrices,  Q  and  i2,  and  the  nominal  Kalman 
filter  noise  covariances,  Qf  and  Rf,  this  leads  to  the  following  modified  LQG  design 
matrices  (denoted  with  carats): 

(1.9) 

R  =  RAp^NjN^ 

Qf  =  Qj-\-  PaM„Mj  +  /XfcMfcM/ 

Rf  =  Rfh 

The  main  weakness  of  the  LQG/PRE  scheme  seems  to  be  the  fact  that  it  models 
the  auxiliciry  input  variables  as  white  noise.  In  order  to  reflect  reality,  these  fictitious 
input  signals  should  generadly  be  modeled  quite  differently.  For  ex2unple,  suppose  there 
is  a  single  uncertain,  but  constant,  parameter  in  A  (i.e.,  w„  is  scalar,  and  A  A  is 
constant).  Then  a  true  model  of  the  perturbed  system  would  require  to  be 
proportional  to  one  of  the  states,  say  i,-  (through  w„  =  L^N^x).  Since  the  sign  and 
magnitude  of  the  perturbation  is  unknown  (i.e.,  unknown),  cannot  be  modeled 
faithfully;  however,  intuitively,  it  seems  its  dyneonics  should  more  closely  resemble  the 
dynamic  behavior  of  ij  rather  than  that  of  a  constant-intensity  white  noise  variable. 

Considerable  research  has  already  been  conducted  that  will  be  applied  to 
constructing  a  more  meaningful  model  of  the  auxiliary  inputs.  This  research  was  done 
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outside  the  LQG/PRE-framework,  and  did  not  include  the  related  problem  of  penalizing 
auxiliary  outputs.  Using  stocheistic  differential  equation  theory,  Wonham  (1967,  1968) 
developed  modified  Riccati  equations  for  both  LQ  regulator  and  Kalm<in  filter  design  for 
systems  with  “state-dependent  noise”.  State-dependent  noise  arises  in  systems  with 
state  equations  of  the  form 


X  =  Ax  +  Bu  +  G{x)w  (1-10) 

where  G{x)  is  some  function  of  the  state,  and  tu  is  a  (vector)  white  noise  process.  In  the 
above  exaimple,  we  would  model  as  state-dependent  noise,  which  would  make  it 
white  noise  with  time-varying  intensity  proportional  to  i,.  Equivalently,  one  could 
conceptualize  AA  as  a  matrix  whose  elements  are  white  noise  processes. 

Hyland  (Hyland  and  Madiwale  1981,  Hyland  1982),  motivated  by  the  maximum 
entropy  principle  and  the  concept  of  A  A  as  a  matrix  of  white  noise  processes,  modeled 
imcertainties  in  the  A-matrix  by  adding  state-dependent  noise  to  the  plant  model.  In 
doing  so,  he  developed  the  equations  for  the  full  LQG  compensator  for  systems  with 
state-dependent  noise.  For  such  systems,  the  separation  principle  no  longer  holds.  The 
regulator  and  filter  design  require  the  solution  not  of  two  uncoupled  Riccati  equations, 
but  of  four  coupled  equations  —  two  modified  Riccati  equations  and  two  modified 
Lyapunov  equations.  Later,  Bernstein  and  Hyland  (1988a)  extended  this  result  to  apply 
to  systems  with  state-,  control-,  and  measurement-dependent  noise,  effectively  zdlowing 
parameter  perturbations  in  all  three  system  matrices  (AA,  AH,  and  AC,  respectively) 
to  be  modeled  as  white  noise  processes.  The  design  process  is  computationally  more 
difficult  thcin  that  of  simple  LQG  in  that  it  requires  the  iterative  solution  of  four 
coupled  matrix  equations,  but  notably  the  optimal  compensator  is  of  the  same  order  as 
the  plant.  That  is,  adding  state-,  control-,  and  measurement-dependent  noise 
(henceforth  referred  to  collectively  as  multiplicative  white  noise)  to  a  system  involves  no 
increase  in  compensator  order. 

As  stated  at  the  beginning  of  this  section,  this  research  involves  modeling 
fictitious  noise  sources  more  accurately  in  an  LQG-type  problem.  Specifically,  the 
LQG/PRE  approach  will  be  adopted,  with  modifications  that  are  intended  to  improve 
the  efficiency  of  the  performance/stability  robustness  tradeoff  by  remodeling  the 
auxiliary  input  signals  such  that  they  more  closely  reflect  reality.  Two  different 
fictitious  noise  models  will  be  investigated,  one  motivated  by  consideration  of  the  time- 
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domain  knowledge  of  the  auxiliary  input  signals,  and  the  other  by  frequency- domain 
knowledge. 

The  time-domain  method  consists  of  using  multiplicative  white  noise  in  place  of 
simple  white  noise  to  model  the  auxiliary  input  signals.  Recalling  the  simple  example 
mentioned  above,  the  motivation  for  this  approach  is  illustrated  in  Figure  1.4.  These 
plots  were  derived  from  a  two-mode  simulation  of  the  rectangular  plate  apparatus 
described  in  section  1.3.  A  true  constant  parameter  uncertainty  in  the  A-matrix  wotild 
be  compensated  for  optimally  if  were  modeled  perfectly  (i.e.,  =  kx^,  where  the 

constant  k  is  determined  by  the  sign  and  magnitude  of  the  parameter  variation),  as 
shown  in  Figure  1.4a.  However,  since  k  is  unknown,  this  model  is  impossible  to 
implement.  The  LQG/PRE  method  uses  constant-intensity  white  noise  to  model 
(say,  Wa  =  w),  as  in  Figure  1.46.  But  the  multiplicative  white  noise  approach  will  use 
state-dependent  noise  to  take  advantage  of  the  fluctuations  in  x,  (i.e.,  =  ijU;),  as 

shown  in  Figure  1.4c.  Presumably,  the  extent  to  which  this  noise  model  improves  the 
LQG/PRE  design  will  depend  on  the  degree  to  which  the  amplitude  of  i,-  varies  over 
time.  In  particular,  the  multiplicative  noise  model  should  provide  a  greater 
improvement  for  transient  noise  rejection  than  for  steady-state  (i.e.,  constant-intensity) 
noise  rejection.  This  modification  of  LQG/PRE  requires  solution  of  the  coupled  Riccati 
and  Lyapxmov  equations,  but  does  not  result  in  an  increase  in  compensator  order. 

The  frequency-domain  method  involves  am  attempt  to  match  the  true  auxiliary 
inputs  in  the  frequency  domain.  Time-domain  matching  suffers  from  the  fact  that  the 
sign  of  the  parameter  perturbations  is  unknown.  As  a  result,  a  very  broad  band  signal 
is  used  in  an  attempt  to  cover  both  possibilities  at  every  time  instant.  However,  our 
knowledge  of  the  frequency  content  of  the  auxiliary  inputs  is  limited  only  by  our 
knowledge  of  the  disturbance’s  frequency  content  and  of  the  closed-loop  system 
dynamics.  So,  given  an  assumed  disturbance  model,  the  true  auxilieiry  inputs  can  be 
modeled  fairly  accurately  in  the  frequency  domaiin  by  closing  the  loop  with  am  LQG 
compensator  which  gives  the  desired  nominal  performance  amd  meaisuring  the  frequency 
response  of  W/,,  and  (see  Figure  1.3).  Therefore,  the  frequency- domain  behavior 
of  the  auxiliary  inputs  will  be  approximated  by  passing  fictitious  white  noise  signads 
through  suitable  frequency-shaping  filters.  The  effect  is  demonstrated  in  the  frequency 
spectrum  plots  of  Figure  1.5,  generated  from  the  saune  data  used  in  Figure  1.4.  The 
frequency-shaped  noise  model,  produced  by  meains  of  a  second-order  filter,  provides  a 
much  better  frequency-domain  replica  of  the  true  signad  {w^  =  kx^)  than  does  a  white 
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Figure  1.5:  Frequency- Domain  Auxiliary  Input  Models 
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noise  model  {w^  =  w).  As  with  time-domain  matching,  the  correct  magnitude  of  the 
auxiliary  inputs  cannot  be  duplicated,  but  the  intensity  of  the  noise  signals  used  to 
generate  the  noise  models  may  be  adjusted  as  design  peu'ameters.  This  modification  of 
LQG/PRE  does  not  require  solution  of  the  coupled  Riccati  and  Lyapunov  equations; 
however,  it  effectively  increases  the  size  of  a  full-order  compensator,  since  it  involves 
augmentation  of  the  plant  model  with  additionad  dynamics. 

C.  Compensator  Order 

Many  methods  are  available  to  reduce  compensator  order,  but  they  fall  into 
three  basic  categories  —  model  reduction,  controller  reduction,  and  direct  design 
(Anderson  and  Liu  1989).  Model  and  controller  reduction  may  be  accomplished  by  such 
methods  as  balanced  truncation  (Moore  1981)  or  Hankel  norm  approximation  (Glover 
1984),  but  the  optimality  of  the  compensator  (for  a  given  controller  order)  —  and 
sometimes  even  stability  —  is  preserved  only  by  designing  a  reduced-order  controller 
directly  from  a  full-order  plant  model.  The  equations  for  the  direct  design  of  LQG 
reduced-order  controllers  were  developed  by  Kwakemaak  and  Sivan  (1972,  sec.  5.7), 
among  others.  However,  the  solution  of  these  equations  requires  a  gradient  search  on 
the  many  free  parameters  of  the  controller  state-space  model.  A  far  more  practical 
method  was  developed  by  Hyland  and  Bernstein  (1984),  involving  solution  of  the 
optimal  projection  equations  —  a  coupled  system  of  two  modified  Riccati  and  two 
modified  Lyapunov  equations,  similar  to  those  mentioned  above  in  the  discussion  on 
stochastic  control  with  multiplicative  white  noise.  The  Lyapunov-type  equations  are 
analogous  to  the  Lyapunov  equations  required  in  solving  the  bedanced  truncation  of  the 
plant  or  controller,  but  they  are  coupled  with  the  modified  Riccati  equations  by  mecins 
of  zm  optim<il  projection  matrix  when  the  controller  is  less  than  full-order.  This 
coupling  demonstrates  the  fact  that  balancing  and  controller  design  cannot  be 
performed  separately  to  obtain  an  optimal  reduced-order  controller. 

The  suggested  modifications  to  LQG/PRE  easily  fit  into  the  framework  of 
optimal  reduced-order  compensators,  with  minor  modifications  to  the  optimal  projection 
equations.  This  is  an  important  feature,  since  the  degree  to  which  controller  reduction 
is  possible  may  very  well  determine  whether  modem  control  applied  to  flexible 
structures  can  be  implemented. 
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D.  Steady  State,  Transieiit  Disturbance  Rejection 

When  sensors  2u:e  available  to  measure  incoming  disturbances,  feedforward 
control  becomes  an  alternative  to  feedback  control.  The  main  advantage  of  feedforward 
control  is  that  the  compensator  does  not  need  to,  in  effect,  model  the  plant  dynamics. 
Therefore,  feedforward  control  app>ears  to  be  an  attractive  alternative  when  small 
compensator  order  is  important.  However,  experience  has  shown  that  the  advantage  in 
compensator  order  can  be  significantly  reduced  if  the  feedback  controller  is  designed  by 
optimal  projection.  Also,  feedback  controllers  can  provide  added  damping  to  the  plant 
amd  therefore  provide  the  ability  to  reject  transient  disturbances.  For  these  reasons, 
only  feedback  control  techniques  are  studied  here. 


1.2  Contributions 

This  research  has  a  number  of  contributions.  Firstly,  the  internal  feedback  loop 
(IFL)  modeling  principle  of  Tahk  and  Speyer  is  generalized  and  fully  exploited  to 
maximize  its  potential.  Secondly,  the  concept  of  multiplicative  white  noise  is  given 
broader  application  by  means  of  a  new  interpretation,  and  a  complete  derivation  is 
given  of  the  controller  design  equations  fi:om  first  principles.  Thirdly,  implementation 
concerns  are  thoroughly  discussed,  and  explicit  algorithms  are  developed  that  have 
application  beyond  this  study.  Lastly,  the  validity  of  the  controller  design  techniques  is 
demonstrated  by  their  implementation  on  flexible  structure  hardware. 

The  original  IFL  modeling  technique,  used  by  Tahk  and  Speyer  for  PRLQG  and 
by  Lin  for  LQG/PRE,  was  restricted  to  a  white  noise  model  for  the  auxiliary  inputs  and 
resulted  only  in  full-order  controllers.  In  this  research,  it  is  shown  that  a  reliance  on 
white  noise  models  may  result  in  relatively  poor  performance.  Frequency-shaped  noise 
iind  multiplicative  white  noise  models  are  developed  to  improve  the  flexibility  of  this 
LQG-based  design  method.  Also,  optimal  order  reduction  is  incorporated  into  this 
framework  for  the  first  time.  The  tremendous  savings  in  controller  duty  cycle  that 
result  are  demonstrated,  suggesting  that  the  addition  of  this  feature  may  be  critical  to 
the  ability  of  these  parameter-robust  controllers  to  be  implemented. 

The  controller  design  equations  for  plant  models  with  multiplicative  white  noise 
have  already  been  derived  for  application  to  a  less  gener2d  and  differently  motivated 
robustness  problem.  However,  derivations  published  up  to  now  have  been  incomplete. 
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frequently  presuming  knowledge  of  stochastic  differential  equations  and  omitting 
nontrivial  steps  and  clarifications  which  fully  explain  the  applicability  of  the  design 
equations.  This  work  attempts  to  fill  that  void  by  suppljring  an  entire  and  unbroken 
derivation,  complete  with  surrounding  discussion  amd  explicit  references  to  readily 
available  sources. 

In  the  implementation  phase  of  the  design,  a  number  of  decisions  must  be  made 
which  affect  the  performance  of  the  control  system.  The  procedures  that  were  used  to 
resolve  these  problems  are  discussed  in  detail,  and  explicit  algorithms  are  developed  for 
the  solution  of  the  coupled  Riccati  and  Lyapunov  equations  of  Chapters  3  and  4. 

The  application  of  the  parameter-robust  and  reduced-order  controller  designs  to 
the  control  of  an  actual  flexible  structure  demonstrate  their  ability  to  solve  real 
problems  and  allow  us  to  quantify  the  performance  of  these  designs.  Saicrifices  in 
performance  due  to  robustness  enhancement  and  controller  reduction  are  measured  2ind 
limitations  are  discovered  as  to  the  maximum  amount  of  controller  reduction  possible. 
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2.  Frequency-Shaped  Noise 


2.1  Problem  Statement 

The  frequency-domain  method  of  auxiliary  input  modeling  attempts  to  replicate 
the  frequency-domain  behavior  of  the  (unknown)  auxiliary  input  signals  by  means  of  a 
frequency-shaped  noise  model.  Rather  than  using  white  noise  (as  does  LQG/PRE)  to 
model  the  auxiliary  input  signals  and  (see  Figure  1.3),  a  much  more  precise 

determination  is  made  of  the  actual  power  spectral  density  of  these  signals.  The 
resulting  auxiliary  noise  model  is  then  combined  with  a  cost  functional  f>enalty  on  the 
auxiliary  output  signals  —  Zf,,  and  z^  —  to  form  a  new  method  of  parameter-robust 
controller  design.  The  noise  modeling  phase  of  this  design  method  consists  of  1)  finding 
an  approximation  for  the  power  spectra  of  the  auxiliary  input  signals,  2)  designing  a 
noise  shaping  filter  to  warp  the  power  spectrum  of  white  noise  into  this  shape,  and  3) 
appending  the  shaping  filter  dynamics  to  the  plant. 

Referring  to  the  LQG/PRE  model  of  the  plant  in  equations  (1.7)  and  Figure  1.3, 
we  see  that  the  actual  power  spectra  of  the  auxiliary  inputs  caimot  be  found,  because 
these  signals  are  a  function  of  the  unknown  diagonal  matrices  L^,  Lb,  and  L^,  and  of  the 
compensator  matrix  triple,  {Ae,Bc,Cc},  which  is  yet  to  be  designed.  However,  a  very 
good  approximation  to  the  power  spectra  may  still  be  found.  For  relatively  small 
perturbations  in  the  A-,  B-,  amd  C-matrices  of  the  plamt,  compared  to  the  nominal 
values,  the  feedback  and  feedforward  loops  containing  the  auxiliary  signals  have  little 
effect  on  the  frequency  content  of  the  states  and  controls,  upon  which  the  auxiliary 
inputs  depend.  Therefore,  we  may  safely  break  these  loops  at  the  auxiliary  inputs  (see 
Figure  2.1)  for  the  purpose  of  approximating  the  power  spectra.  Now,  the  unknown 
diagonal  L-matrices  only  determine  the  sign  and  magnitude  of  the  auxiliary  inputs,  not 
their  fi:equency  content,  and  the  elements  of  the  vectors  w^,  Wb,  and  are  proportional 
to  those  of  Zo,  Zj,  and  z<.,  respectively.  Therefore,  the  task  of  finding  the  power  spectral 
densities  of  the  auxiliary  input  signals  is  equivalent  to  finding  the  power  spectra  of  the 
auxiliary  outputs,  modulo  some  proportionality  constants.  The  values  of  those 
constants  are  not  important,  because  the  relative  noise  intensities  of  the  auxiliary  inputs 
must  later  be  adjusted  in  order  to  provide  the  desired  amount  paraimeter  robustness. 
The  problem  of  not  knowing  the  compensator  matrices  a  priori  can  be  treated  by 
designing  a  standard  LQG  compensator  based  on  the  nominal  parameter  values  of  the 
plant  and  using  it  to  calculate  power  spectra.  Experience  has  shown  that  the  LQG 
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compensator  can  provide  an  excellent  substitute  for  this  purpose.  If  greater  acctiracy 
were  required,  one  could  use  an  iterative  process  of  alternately  designing  a  compensator 
and  approximating  power  spectra  until  a  satisfactory  noise  model  were  found. 


2.2  Noise  Shaping  Filter  Design 

Figure  2.1  illustrates  the  resulting  model  used  to  obtain  approximations  of  the 
power  spectral  densities  of  the  auxiliary  input  signals.  The  corresponding  state-space 
model  of  the  n-th  order  plant,  n-th  order  compensator,  and  auxiliary  inputs  (asstiming 


the  Irmatrices  are  all  identity  matrices)  is  given  by 

x{t)  =  Ax(t)  -I-  Bu(t)  -f-  Giit>(<)  (2.1) 

yit)  =  Cx(t)  +  G2w{t)  (2.2) 

x,(f)  =  >l,x,(<)  +  S,y(t)  (2.3) 

u(f)  =  (2.4) 

w^{t)  =  Niu{t),  Wc{i)  =  N^x{t)  (2.5) 


It  is  assumed  without  loss  of  generality  that  the  exogenous  noise  signal,  w(t),  is  a  vector 
of  mutually  independent,  unit  intensity  white  noise  processes.  Included  in  w{t)  are  both 
the  process  noise  and  sensor  noise.  The  closed-loop  system  with  input  w  and  outputs 
Wa,  lUfc,  and  is  described  by 


2(t)  =  A2(t)-|-Gu;(<) 

«^a(0  =  l^a  0]®(<) 

=  N,C,]x{t) 

«^c(0  =  (-^c  0]l(t) 

where 

G,  ■ 
Bfi, 


x{t)^ 


x{t) 

Xc{i) 


A  BC^ 

BcC  A, 


G  ^ 


(2.6) 

(2.7) 

(2.8) 

(2.9) 

(2.10) 
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From  this  closed-loop  model  and  the  assumptions  on  w,  the  power  spectral 
densities  of  the  elements  of  w^,  w^,  and  may  be  derived  one  at  a  time.  Let  us  say, 
arbitrarily,  that  we  are  interested  in  and  that  it  is  a  scalar.  The  following  procedure 
is  easily  repeated  for  all  other  existing  auxiliary  inputs.  Then  we  have  the  frequency- 
domain  description  of 


^a  =  [N.  0]isI-A)-^Gw  (2.11) 

^  H(s)w 

In  general,  then,  H{s)  is  a  2n-th  order,  multi-input,  single-output  transfer  function 
matrix.  If  the  plant  is  of  sufficiently  low  order,  we  could  stop  here  and  use  H{s)  as  the 
noise  shaping  filter  to  be  appended  to  the  plant.  Normally,  however,  this  would  not  be 
practical,  so  a  lower-order  filter  will  be  found,  which  is  single-input,  single-output.  One 
might  suspect  that  a  low-order  balanced  tnmcation  approximation  to  H{s)  would 
provide  a  good  substitute,  but  examination  of  their  comparative  firequency  responses  for 
particular  examples  has  shown  that  there  is  a  much  more  effective  method  of  designing 
an  accTirate  noise  shaping  filter. 

First  we  find  an  expression  for  the  power  spectral  density,  {u>),  of  w^.  For 

o  o' 

a  vector  exogenous  noise  process  w,  the  definition  of  H{s)  in  (2.11)  leads  to 
(Maciejowski  1989,  p.  98): 

5  (u,)  =  H{u)S^Ju;)H-{u;)  (2.13) 

a  a 

where  is  the  complex  conjugate  trmspose  of  the  trmsfer  fimction  matrix  from  w 

to  w^.  A  suitable  shaping  filter  designed  to  3rield  this  power  spectrum  may  be  found  by 
deriving  its  magnitude  frequency  response  from  the  information  in  (2.13)  and  curve 
fitting  a  transfer  function  to  give  that  response.  The  desired  frequency  response  of  our 
shaping  filter  transfer  function,  is 

(2.14) 
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In  order  to  hold  down  the  controller  order,  the  magnitude  response  in  (2.14)  should  be 
fit  using  a  shaping  filter  of  as  low  order  as  possible. 


2.3  Controller  Design  Summary 

Once  the  shaping  filters  to  produce  auxiliary  inputs  w^,  Wf,,  and  have  been 
foimd  —  say,  fl'a(w),  and  respectively  —  they  may  be  appended  to  the 

plant,  as  shown  in  Figure  2.2.  Define  the  state-space  models  of  the  shaping  filters  by 
the  following  identities. 


(2.15) 

Hi,{u})  =  —  j4„j,)  “  -I-  D^i, 

~  ~  -I- 


Then  the  state-space  model  of  the  plant,  (2.1)  and  (2.2),  is  augmented  so  that  we  have. 
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(2.16) 


The  supplement^^^y  independent  white  noise  variables  Uj,,  and  are  treated  as 
exogenous  noise  sources,  and  there  are  as  many  of  these  additional  noise  sources  in  the 
augmented  model  as  there  are  independent  parameter  uncertainties  in  the  plant. 
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After  augmenting  the  plant  with  the  noise  shaping  filter  dynamics  as  above,  the 
rest  of  the  frequency-domain  method  is  the  same  as  LQG/PRE.  The  intensities  of  the 
auxiliary  inputs  and  the  weights  on  the  auxiliary  outputs  are  adjusted  to  provide  the 
desired  eimount  of  pcirameter  robustness.  Then  a  staindard  LQG  compensator  is 
designed  based  on  the  augmented  model  with  modified  noise  intensities  and  cost 
functional. 
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3.  Multiplicative  White  Noise 


3.1  Problem  Statement 

The  time-domain  method  of  auxiliary  input  modeling  uses  artificial  noise  of 
time-varying  intensity  to  account  for  the  time-varying  amplitude  of  the  actual  auxiliary 
input  signals.  Since  the  amplitude  of  the  true  signals  is  proportional  to  a  known  linear 
combination  of  the  states  or  controls,  a  multiplicative  white  noise  model  provides  the 
desired  proportionate  noise  intensity  as  a  function  of  time.  As  in  Chapter  2,  the 
auxiliary  input  model  is  supplemented  with  a  cost  functional  penalty  on  the  auxiliary 
outputs.  The  time-domain  method  without  the  added  flexibility  of  auxiliary  output 
penalties  is  equivalent  to  the  compensator  design  method  of  Bernstein  and  Hyland  (e.g., 
1988a)  for  multiplicative  white  noise  for  uncertain  systems,  although  the  motivation 
here  is  different.  Since  the  optimal  linear  quadratic  compensator  for  the  resulting  plant 
model  cannot  be  found  by  standard  LQG  techniques,  a  more  lengthy  development  is 
required  here  than  was  necessary  for  the  frequency-domaun  method  of  Chapter  2. 

Modeling  of  the  auxiliary  input  signals  by  multiplicative  white  noise  requires 
modification  of  the  state  and/or  output  equations  of  the  plant.  For  the  general  case  of 
uncertainties  in  the  A-,  B-,  and  C-matrices,  we  have  the  n-th  order  system, 

dxt  =  Axtdt  -t-  X)  -I-  Butdt  -I-  jiBiUtdvn  +  G^d^f  (3.1) 

«■  =  1  i  =  1 

dy,  =  Cxidt  -I-  f;  'fiCiXtdvn  G^d^t  (3-2) 

•  =  1 

These  equations  are  written  in  differential  form,  because  the  development  of  the 
LQ  optimal  compensator  for  systems  with  multiplicative  white  noise  requires  the  use  of 
a  more  rigorous  form  of  stochastic  differentiad  equation  theory.  The  theory  makes 
subtle  distinctions  in  the  interpretation  of  noise  processes,  which  will  be  reviewed  briefly 
in  the  next  section.  Such  precise  definitions  atfe  inconsistent  with  the  use  of  the  concept 
of  Gaussiam  white  noise,  which,  strictly  speadcing,  is  not  well  defined  in  the  more 
rigorous  framework.  Therefore,  all  of  the  noise  vairiables  in  this  chapter  will  be  written 
ais  differentiads  of  Wiener  •processes,  adso  known  ais  Brovmian  motion  processes  and 
loosely  referred  to  as  the  “integral  of  Gaussiam  white  noise”.  Note  adso  that  vairiables 
which  aire  a  function  of  time  aire  identified  ais  such  by  a  “t”  in  the  subscript.  This 
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compact  notation  is  used  throughout  the  chapter  in  order  to  prevent  the  equations  from 
becoming  too  cumbersome. 


The  model  of  equations  (3.1)  and  (3.2)  represents  a  system  with  p  independent 
auxiliary  input  signals,  arising  from  p  independent  parameter  imcertainties.  The  u,, 
(i  =  l,2,...,p)  are  scalar  independent  Wiener  processes,  with  and  is  a 

vector  Wiener  process  with  =  Vdt.  As  an  illustrative  example,  consider  the 

following  1-mode  damped  oscillator  with  acceleration  output  and  two  independent 
auxiliary  input  signals. 


0  0 
0 


\ctdvu  + 


Ft  + 


0 

^2 


hitdv^t  + 


0  0 
1  0 


(3.3) 


dy|=^  —  2Cwja:tdf 0  -1-^  0  1  jd/3^  (3.4) 

An  uncertainty  in  the  damping  parameter,  —  2Cu>,  must  occur  in  both  the  A-  and  C- 
matrices.  Since  these  parameter  uncertainties  are  necessarily  the  same,  they  comprise  a 
single  independent  uncertainty  and  utilize  the  same  noise  process,  dv^.  The  resulting 
auxiliary  inputs  consist  of  (1)  state-dependent  noise  (noise  with  intensity  proportional  to 
the  states  and  entering  at  the  process  noise  port)  and  (2)  measurement-dependent  noise 
(noise  with  intensity  proportional  to  the  states  and  entering  at  the  sensor  noise  port). 
An  uncertainty  in  the  S-matrix  will  generally  be  independent  of  the  d2imping  parameter 
uncertainty  and  is  provided  for  by  an  independent  noise  process,  V2t,  yielding  an 
auxiliary  input  which  is  control-dependent  noise  (noise  with  intensity  proportional  to 
the  input  and  entering  at  the  process  noise  port).  The  positive  scal2Lr  constants  amd 
^2  are  design  pairameters  which  effectively  adjust  the  intensities  of  the  auxiliary  input 
noise  signals  accoraing  to  the  magnitude  of  the  uncertainties  present. 

Given  the  n^-th  order  state-space  model  of  the  compensator, 

dx^t  =  +  B^dyt  (3.5) 

u,  =  C^x^t  (3.6) 

where  n^  =  n  for  a  full-order  controller,  we  can  represent  the  closed-loop  system  as 
follows. 
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(3.7a) 


dx  t  =  Ax  fdt  +  Yif  AiX  fdvit  +  Gd/Sf 

i  =  1 


=  Ax ^dt t  A.^t  ...  j4pZ,  cj 
In  equation  (3.7)  we  have  defined  the  vector  Wiener  process 

T 

dvt=^dvit  dv2t  ...  dwpt] 


dvf 


(3.7b) 


(3.8) 


and  the  closed-loop  quantities,  denoted  by  tildas,  as  follows: 


~  A  , 
n  =  n  -1-  n^, 


A 


^et 


54 


Gi 


a4 


A  BC^ 

BcC  A, 


A  4 

11  — 


Ai  B,C, 
0 


(3.9) 


Now  the  compensator  design  objective  may  be  stated  in  terms  of  an  LQ 
optimization  problem.  Define  the  standard  cost  fimctional, 

J(A„  C J  =  Urn  e|  /[x.'T/Zia:,  -h  2xe’*'iZ,2U,  -h  J  dtj  (3.10) 

In  order  to  simplify  the  problem,  the  cost  functional  will  be  rewritten  in  an  equivadent 
form  without  the  integral. 


J{A„ B„ CA  =  limEjxt^iZjX, -I- 2x,^iJi2«t  +  dt) 

t  ^00 


(3.11) 


[Kwahemaak  and  Sivan  1972,  p.  394,  Theorem  5.4].  In  fact,  the  limit  may  be 
eliminated  as  well,  due  to  the  assumed  stationarity  of  the  exogenous  noise  processes, 
provided  the  initial  time  is  infinitely  far  in  the  past  (i.e.,  for  to-»  — oo). 

In  light  of  the  performance  objective  (3.11),  the  LQ  optimal  compensator  design 
problem  for  systems  with  multiplicative  white  noise  may  be  stated  as  follows.  Given  a 
pltuit  (1.1)-(1.2)  with  p  independent  auxiliary  inputs,  find  a  matrix  triple,  {Ac,Bc,Cc}, 
to  realize  a  compensator  (3.5)-(3.6)  which  minimizes  the  cost  functional  (3.11). 
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3.2  Pieliminaiies  of  Stochastic  Differential  Equation  Theory 

Before  deriving  the  necessary  conditions  for  the  optimization  problem  just 
stated,  a  brief  overview  will  be  given  of  those  elements  of  stochastic  differential 
equation  theory  which  will  be  used  in  the  derivation.  For  more  information  on  matters 
where  specific  references  are  not  given,  see  Jazwinski  (1970)  and  references  therein. 

Consider  the  stochastic  differential  equation, 

dxt  =  f{xt,t)dt  +  G(xt,t)d^t ,  f  >  <0  (3.12) 

where  /3,  is  a  vector  Wiener  process  with  =  Q{t)dt.  Associated  with  this 

equation  is  the  integral  equation, 

-^*0=1  fi^T,  T)dT  +  J  Gix^,  T)dp^  (3.13) 

°  ‘o  ‘o 

The  first  integral  in  (3.13)  can  be  defined  as  a  Riemann  integral  for  sample  functions  of 
X,  or  as  a  mean  square  Riemann  integral  (Jazwinski  1970,  p.  66)  for  x  a  stochastic 
process.  If  G  were  a  function  of  time  only,  as  would  be  the  case  for  ordinary  state 
equations  with  additive  white  noise,  the  second  integral  would  be  known  as  a  Wiener 
integral  However,  added  complications  arise  when  multiplicative  white  noise  is  present 
(G  a  function  of  x).  In  this  more  general  case,  the  second  integral  may  aJso  be  defined 
in  a  mean  square  sense,  but  such  a  definition  will  not  be  unique.  Ito  (1944)  was  the 
first  to  define  this  type  of  stochastic  integral.  In  doing  so  he  modeled  the  Riemann  sum 
after  a  forward  difference  equation,  effectively  sampling  the  integrand  at  the  beginning 
of  each  partition,  and  then  proceeded  with  a  me2m  square  limit  to  define  the  integral. 
When  the  integral  is  interpreted  in  this  sense,  it  is  called  ^m  ltd  stochastic  integrcd,  and 
the  associated  differential  equation  (3.12)  is  called  an  Ko  stochastic  differential  equation, 
or  ltd  equation.  The  rules  of  calculus  which  result  from  this  interpretation  are  called 
ltd  stochastic  calculus.  Note  that  the  closed-loop  state  equation,  (3.7),  can  be 
interpreted  as  an  Ito  equation. 

What  is  not  immediately  apparent  is  that  the  second  integral  in  (3.13)  has 
different  interpretations  according  to  where  the  partitions  of  the  Riemann  sv^n  are 
sampled.  For  example,  2mother  important  interpretation,  called  the  Stratonovich 
stochastic  integral  (Stratonovich  1966),  arises  when  the  differential  equation  is  modeled 


cis  a  central  difference  equation  before  taking  the  limit  (i.e.,  partitions  sampled  in  the 
center).  Actually,  the  differences  between  the  Ito  and  Stratonovich  stochastic  integrals 
are  somewhat  more  profound  than  that,  but  the  mathematical  details  which  distinguish 
the  two  cire  beyond  the  scope  of  this  work.  The  purpose  of  the  foregoing  discussion  is  to 
inform  the  reader  that  the  stochastic  differential  equation  (3.12)  which  appears  in  state- 
space  models  with  multiplicative  white  noise  has  no  meaning  without  assigning  the 
aissociated  integrals  a  particular  interpretation  (e.g.,  Ito  or  Stratonovich  integrals). 

The  Ito  stochastic  integral  is  defined  over  a  much  broader  claiss  of  functions  than 
the  Stratonovich  integral  and  is  used  for  most  theoretical  work  in  stability  and  control. 
Therefore,  Ito  stochastic  calculus  will  be  applied  in  the  development  of  the  next  section. 
The  Stratonovich  integral  does  have  a  number  of  nice  properties,  but  a  simple 
transformation  exists  between  differential  equations  of  the  two  different  interpretations 
(Jazwinski  1970,  pp.  118-20,  131).  It  is  not  clear  which  interpretation  is  more  “correct” 
for  the  application  studied  in  this  work.  However,  Wong  £ind  Zakai  (1965)  discovered  a 
property  of  the  Stratonovich  noise  model  that  gives  it  intuitive  appeal  for  appUcations 
to  physical  systems: 

Theorem  3.1:  Let  x,  be  the  solution  to  the  stochastic  differential  equation  (3.12).  Now, 
replace  the  Wiener  process  with  a  sequence  of  continuous  piecewise  linear 
approximations,  such  that  0^"^  converges  to  as  n-*oo.  Then  the  solutions, 
to  the  resulting  sequence  of  ordinary  differential  equations  converge  to  ij  if 
in  (3.12)  is  interpreted  in  the  Stratonovich  sense.  This  property  does  not  hold  in 
general  for  interpreted  in  the  Ito  sense. 

Because  of  Theorem  3.1  aind  the  fact  that  Bernstein  (e.g.,  1987,  Bernstein  and  Hyl^uld 
1988b)  argues  in  favor  of  the  Stratonovich  interpretation  for  applications  to  lightly 
damped  flexible  structures,  a  comparison  will  be  made  in  Chapter  6  of  the  performance 
of  the  two  different  controllers  that  result  from  the  two  stochastic  integral 
interpretations  when  applied  to  a  particular  example. 

An  important  result  of  Ito  stochastic  calculus,  called  Ito’s  integration  formula, 
will  be  used  in  the  next  section.  It  is  derived  directly  from  Ito ’s  Theorem  below. 

Theorem  3.2  (Ito’s  Theorem  or  Ito’s  Chain  Rule)  (e.g.,  Jazwinski  1970,  p.  112,  Lenuna 
4.2)  Also,  for  a  more  formal  proof,  see  Gikhman  and  Skorokhod  (1969,  pp.  387-91): 
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Let  be  a  real,  scalar  linear  functional  with  continuous  partial  derivatives, 


,  A  d<i> 


,  A 


(3.14) 


where  is  the  unique  solution  to  the  vector  ltd  stochastic  di£ferenti2Ll  equation, 
(3.12),  and  is  a  vector  Wiener  process  with  E{d/9,d/3t^}  =  Q{t)dt.  Then  the 
stochastic  differential  d<t>  of  (f>  is 


d<t>  =  ^fdt  +  <f>x^dx,  +  5  trace{GQG^^„}dt  (3.15) 

The  existence  of  the  third  term  in  (3.15)  establishes  the  fact  that  the  usual  chain  rule 
does  not  hold  in  ltd  stochastic  calculus,  although  it  does  hold,  incidentally,  in 
Stratonovich  stochastic  calculus. 

Integrating  (3.15)  and  taking  the  expected  value  conditioned  on  x^  =  Xq,  Ito^s 
integration  formula  results  (Wonham  1970,  p.  137): 

E{^(x„<)  I  x,^  =  Xo} 

=  („)  +  e|  J  trace{GQG’-^„)]  rfr  |  x,^  =  icj  (3.16) 

The  argxunents,  (x,.,r),  have  been  omitted  for  functions  in  the  integrand  in  order  to 
conserve  space.  The  term  in  (3.15)  involving  d^t  (by  way  of  dif)  does  not  appear  in 
(3.16)  due  to  the  Martingale  property  of  a  stochastic  integral  w.r.t.  a  Wiener  process. 
With  this  formula,  we  are  now  ready  to  find  an  expression  for  Q,. 


3.3  Conversion  to  Deterministic  Minimization  Problem 

The  first  step  in  deriving  the  necessary  conditions  will  be  to  simplify  the 
optimization  problem  stated  at  the  end  of  section  3.1  by  converting  it  into  a 
deterministic  minimization  problem.  Define  the  closed-loop  state  covairiance, 

0,*E{x,i/),  (3.17) 

and  the  closed-loop  state  weighting  matrix. 


Rl  Ri2^c 

CJR,C, 


(3.18) 


Then  the  cost  functional  of  (3.11)  becomes, 


(3.19a) 


=  trace{Qfl} 


(3.19b) 


The  objective  then  is  to  minimize  the  cost  as  described  by  (3.19b).  The  quantity  R  is 
known,  because  it  is  specified  by  the  cost  criterion  in  the  problem  statement.  It 
remains,  however,  to  find  an  expression  for  the  closed-loop  state  covariance,  Q. 

Define  the  functional,  (f>ijix »)  =  x  |,x  where  x  denotes  the  i***  element  of  the 
solution  vector  x ,  to  the  closed-loop  state  equation  (3.7).  Note  that 
Qtihj)  =  E{^,-j(x  <)},  The  /,  G,  and  Q  quantities  of  (3.7)  ^^s  defined  by  the  generic  Ito 
equation,  (3.12),  are  given  by, 

/(*,)  =  Xi„  g(i,)=[3,s,  3,1,  ...  a,*,  g], 


g  =  diag{7i,72,...,7p,V} 


(3.20) 


The  pcirtial  derivatives  needed  for  Ito’s  integration  formula  aure  computed  as  follows. 


<f>t  =  0. 


(3.21) 


0  X,,  0...0  x„  0...0],  i^j 
[0...0  2x„  0...0],  i  =  j 


(3.22a) 


(3.22b) 


where  Ejj  denotes  the  elementary  matrix  with  *  in  the  (i,  position  and  O’s  elsewhere. 
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(0  (i) 


"■{ 


H) 

U) 


(0 


0  .  0 

:  1  : 
;  1  ; 

0  .  0 

0  .  0 

;  2  i 

0  .  0 

(0 


I  j 


t  =  j 


(3.23a) 


(3.23b) 


Therefore,  tbe  second  and  third  terms  of  the  integrand  of  Ito's  integration  formula  are 


^  —  x  (i(Ax  ,)j-  +  X  tj{Ax  t),- 

=  (:4x.xJ),,  +  (A2.xJ).. 

=  (x.xTA'r),.  +  (Ax.xJ),^  (3.24) 

and 

trace{G<}G'''^3; )  =  trace|[A,2 ,  3ji,  ...  A,i,  8]  diag{7„7j,  ...  ,7,,V} 

[a,x,  A,x,  ...  A^.  e]V,  +  £yi)| 

=  taac«{[  J  J.A»3!  ^/A/  +  8 veT  (B,j  +  £,,)} 

=  2  1^£  hkAj*  ,x  TA/)y  +  (3.25) 

where  V  ^  GVG^.  In  the  last  step  of  (3.25),  the  identity 
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(3.26) 


trace{A„x„B„xn)=  Z  II 

•  =  1  i  =  1 

was  used,  2Llong  with  the  s}nnmetry  of  the  expression  enclosed  in  brackets. 

Now  we  may  apply  Ito’s  integration  formula  to  find  an  expression  for  the  closed- 
loop  state  covariance,  Q^. 

E(3i  y  I  S ,  =  I  „}  =  i ,  x  ^  +  e{/1(x  ^  +  (Ax  ^  /)y 

*0 

+ J_(7»A»x  ^  I X  ,^  =  X  „}  (3.27) 

=  §..(■, i)  +  /((«4'')y  +  (AQ,h 

‘o 

,  +t{'1kA,QrAu^h  +  (3-28) 

Qt  =  i[QrA-^  +  AQ,+  t  iMX^  +  V]dT  (3.29) 

°  *0  •  =  1 

By  the  Fundamental  Theorem  of  Integral  Calculus, 

Q,  =  AQ,^-Q,A^+E  lAfl.A;’  +  V  (3.30) 

•  =  1 

The  cost  functional,  (3.19b),  requires  the  steady  state  solution  to  (3.30),  but  first 
we  must  determine  under  what  conditions  such  a  solution  exists.  To  that  end,  we  will 
consider  an  alternate  expression  for  Qt  which  is  useful  for  examining  the  stochastic 
stability  of  the  system.  The  following  lemma  will  be  used  repeatedly. 

Lemma  S.l:  Given  matrices  A,  B,  and  C  of  compatible  dimensions, 

vec(ABC)  =  0  A)vec(B)  (3.31) 

Proof:  See  Graham  (1981,  p.  25,  Property  VIII). 

In  Lemma  3.1,  the  “vec”  operator  merely  organizes  the  columns  of  its  matrix  argument 
into  a  single  column  vector  with  the  first  column  on  top  and  each  subsequent  column 
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beneath  the  previous  one.  The  symbol  “®”  represents  the  Kronecker  product.  Now 
define 

U  ^  A  ®  ^  +  f:  7.A,  ®  (3.32) 

where  “e”  is  the  Kronecker  sum.  Then  taking  the  “vec”  operation  of  (3.30)  and 
applying  Lemma  3.1  to  the  first  three  terms  on  the  right  hand  side,  we  obtain  the 
alternate  expression, 


vec(Qt)  =  A  vec(Q,)  +  vec(V) 


(3.33) 


From  this  equation  we  see  that  when  .X  is  a  stable  matrix,  a  steady  state 
solution,  Q,  exists.  Under  this  condition  (i.e.,  stable  X.)  the  closed-loop  system  (3.7)  is 
said  to  be  second-moment  stable  (Arnold  1974),  and  Q  is  the  solution  to  the  algebraic 
equation. 


0  =  AQ-\-QA^+Y,-fiAiQAi^-\-V  (3.34) 

t  =  1 

which  is  a  modified  Lyapunov  equation.  Note  that  when  the  plant  imcertainties  are 
zero  (i.e.,  7^  =  0  or  =  0,  for  all  i  =  l,...,p)  the  third  term  in  (3.34)  vanishes  and  the 
standard  Lyapimov  equation  results  for  the  steady-state  second-moment  matrix  of  the 
state  vector  for  a  system  driven  by  white  noise  [see  e.g.,  Kwakemaak  and  Sivan  (1972, 
p.lOl,  Theorem  1.52)].  In  order  to  assure  that  the  cost  ftmctionad,  J,  is  finite  and 
independent  of  the  system  initial  values,  only  second-moment  stabilizing  compensators 
will  be  admissible. 

Therefore  the  original  stochastic  optimization  problem  has  been  reduced  to  the 
following  deterministic  one: 

(3.35) 


trace(QS) 

minimize: 

over  the  set: 

{(Ae,Bc,Cc):  A  is  stable} 

subject  to: 

0  =  AQ-\-  QA^  -1-  Z  yiAiQA,^  +  V 

i  =  1 
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3.4  Derivation  of  the  First-Order  Necessary  Conditions 

The  first-order  necessary  conditions  that  a  controller  triple,  must 

meet  to  solve  this  type  of  problem  have  been  derived  by  at  least  two  different  methods 
—  the  Lagrange  multiplier  method  (Bernstein  and  Haddad,  1989),  used  to  solve  an 
problem,  and  the  calculus  of  variations  method  (Bernstein  and  Hyland,  1988a).  The 
Lagrange  multiplier  solution  is  somewhat  easier  to  follow  but  could  not  be  foimd  in 
complete  form  in  the  literature  for  this  particular  problem.  Therefore,  the  bulk  of  it 
will  be  derived  here,  leading  up  to  the  simple  but  more  tedious  algebraic  manipulations, 
which  will  be  referenced  to  a  paper  where  they  are  found  in  their  entirety. 

The  Lagrangian  corresponding  to  the  optimization  problem  (3.35),  can  be  written 
as, 

B„C„Q,  P,  A)  =  tr{Agfi  +  +  §  A''  +  £  +  V)?)  (3.36) 

t  =  1 

where  P  is  an  n  x  n  matrix  of  Lagrange  multipliers  and  A  is  the  scalar  supplementary 
Lagrange  multiplier,  which  is  without  loss  of  generality  equal  to  1  if  the  problem  is 
normal  (i.e.,  A  =  0=»P=0)  —  see,  for  example,  Ewing  (1985,  sec.  5.5).  Since  the 
conventional  Lagrange  multiplier  problem  involves  a  vector  of  constraints  and  hence  a 
vector  of  Lagrange  multipliers,  the  use  of  a  Lagrange  multiplier  matrix  requires  some 
justification.  The  conventioneil  problem  with  equality  constraints  takes  the  form, 

minimize:  f{x,y)  (3.37) 

subject  to:  <7(1)  =  0 

where  /  is  a  scalar  function  of  the  vector  variables  x  and  y,  and  y  is  a  vector  function  of 
X.  Then  the  Lagrangian  to  be  minimized  becomes, 

JL(i,  y,  p,  X)  =  Xf  +  p^g  (3.38) 

with  p  a  column  vector  of  Lagramge  multipliers  with  dimensions  equal  to  those  of  x. 
The  matrix  equaility  constraint  in  (3.35)  may  be  rewritten  in  the  form  (3.38)  by 
applying  the  “vec”  operator  to  the  equation.  Define  the  matrix  expression  on  the  right 
hand  side  of  the  equality  constradnt  in  (3.35)  ais  X.  Then  am  equivalent  to  the 
constraint  JV  =  0  is  vec(X)  =  0,  which  yields  the  Lagrangian, 
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JL(A„B„C„g,p,A)  =  A  tT{QR}  +  p-^veciX) 


(3.39) 


where  p  represents-, an  n  *  x  1  Lagrange  multiplier  vector.  Of  course,  (3.39)  is  equivalent 
to  (3.36).  Applying  the  identity,  (vecA^)^vec(B)  =  tr(i4B)  (Graham  1981,  p.  18, 
Example  1.4),  to  the  term  involving  p,  we  see  that  (3.39)  may  be  written  as 

1(A„ B„C„Q,P,A)  =  A  tT{QR}  +  tr{P'^X}  (3.40) 


where  the  n  x  n  Lagrange  multiplier  matrix  P  is  defined  by  p  =  vec(P).  Clearly,  (3.40) 
is  equivalent  to  (3.36).  Note  that  the  symmetry  of  the  expression  defined  as  X  means 
that  we  may  assume  without  loss  of  generality  that  P  is  S3rmmetric. 

In  taking  the  partial  derivatives  of  the  Lagrangian,  the  following  profterties  will 
be  used  (Graham  1981,  pp.  76-78,  Examples  5.4-.'5.6), 

(3.41) 


Taking  the  partial  derivatives  of  the  Lagrangian,  (3.36),  with  respect  to  each  of  its 
arguments  and  setting  each  equal  to  zero: 


m 


XR  +  A^p  +  PA  +  E  liA^PAi  =  0 

•  =  i 


(3.42) 


In  order  to  test  whether  the  optimization  problem  is  normal,  we  set  A  =  0  in  (3.42). 
Taking  the  “vec”  operation  of  the  equation,  as  well,  the  result  is 


vecP  =  0 


Jj  vecP  =  0 


(3.43) 


Since  J.  is  stable  by  assumption,  has  no  nullspace,  and  therefore  P  =  0.  As 
stated  above,  A  =  0=^P  =  0  means  that  the  problem  is  normal,  and  we  can  take 
A  =  1,  without  loss  of  generality.  Therefore,  with  A  =  1, 
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(3.44) 


P  'V 


R  +  A^P  +  PA  +  E  =  0 

•  =  1 


Before  proceeding,  partition  the  symmetric  matrices,  V,  Q,  and  P,  into  2x2 
blocks  (of  dimensions  n  x  n,  n  x  n,.,  x  n,  and  x  n^): 


,  0  = 


Ql  Qi2 
Qi2^  Q2 


P4 


Pi  P12 

1 

12 


p  T  p 

-*19  -*2 


(3.45) 


Now  the  Lagrangian  may  be  expanded: 

1(A„  B„C„Q,  P)  =  trW.i?,  +  +  Qii'^R^C,  +  Q-fijRfi,  (3.46) 

+  (A(3,  +  +  (^0i2  +  BCMP,,'^ 

+  (B,CQ,  +  A,Q,^-^)P„  +  {B,CQ,,  +  A^P, 

+  (Q,A'^  +  +  e.jA  J)P„-^ 

+  +  <? .C/B'OP,  J  +  (Qn'^C^Bj  +  Q,A/)P, 

+  £  'ril(AiQ,V  +  BA<?.2'".4i’'  + AiO.jCjB,’- 

+  B,<7,0,C7B.’')P, 

+  (AiQ.Ci-^B/  +  BA«n’'Ci’'B,T)P,  T 

+  (Bfifl,Ay  +  B,CiQ.,C7B,’^)P„ 
4-(B.C,Q,C/B7)P,1 


+  V.P.  +  V„BJP„T  +  ^  B.VjBjPj} 


This  allows  us  to  take  the  partial  derivatives  of  the  Lagraingian  with  respect  to  the 
compensator  parameters. 


ax  _n . 

gAi-"- 


P \2^Qi2  +  P  2Q2  +  P  12^Qi2  +  P  2Q2  —  0 


(3.47) 


P  12^^12  +  P  2Q2  —  0 


(3.48) 
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01^  =  0: 


(3.49) 


■  “  ‘  +  Pxx^BfijQx'Tcy + 2P ACiOiO 

+  +  2P,S.Vj  =  0 

(^.■"0,  +  PAx'^  +  +  P  AV,  (3.50) 

+  £  7i(Pn’'A,Q,  +  Pxx'^B.C.Qxx'^  +  P,B,CxQ,)Cx'^  =  0 

i  =  1 

If  for  each  i,  B,-  and  Me  not  both  nonzero  (i.e.,  the  Wiener  processes  used  for 
control  and  measurement  dependent  noise  are  independent),  a  closed  form 
expression  for  can  be  fotmd.  For  then  we  have, 

(3.51) 

which  leads  to, 

P.=  -Pj''I(Pl!'"<?l  +  P2<?.!’')<^  +  Pl2'^(''l2+£7.A0lCi’')lVr'  (3.52) 

1  =  1 

where 

Vj^Vi  +  £7,CiO,c,'" 


(3.53) 


(3.54) 


■^12^^12  "t"  ■^12^Qx2  "I"  ^^2^cQ2 


+  B’^P.Q,,  +  B'^P,,Q,  +  B-^P.Q,,  +  B^P,,Q, 

+  E  'fiiBi'^PxAiQr2  +  B,'^P,A,Q,,  +  2B,^P,Bfi,Q, 

1  =  1 

+  Byp,^B,C,Q,,  +  Byp,,Bfi,Q,,)  =  0 

Rx2^Qx2  +  R2CCQ2  +  B'"(PiQi2  +  ^*12^2)  (3-55) 

+  E  liBi^{P l-^tQn  +  P lBficQ2  +  P 12-® 0^1^12)  ~  ® 

i  =  1 


Assuming  as  before  that  for  each  i,  J5,-  and  C,-  are  not  both  nonzero, 


(3.56) 


Then  we  find  that 

C,=  -Rr  W{PiQi2  +  Pi2Q2)  +  (i2i2^  +  E  liBi'^PxAM2]Q2  -  ^  (3.57) 

i  =  1 

where 


Rn'^Qi2  +  R2CCQ2  +  B^{P,Q,,  +  PM 

+  E  7.5,'^(Pi>1.^?,2  +  PiBiCM  =  0 

t  =  1 


R2  =  R2+f:iiBP^PiBi  (3.58) 

« =  1 

The  equality  constraint  in  (3.35)  ^lnd  the  framed  equations  —  (3.44),  (3.48), 
(3.51),  and  (3.56)  —  are  equivalent  to  equations  (20),  (71),  (74),  (75),  and  (76), 
respectively,  in  (Bernstein  and  Hyland  1988a).  There,  the  derivation  of  the  first-order 
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necessary  conditions  is  completed.  Essentially,  it  involves:  (1)  proving  the  existence  of 
the  inverses  in  (3.52)  and  (3.57);  (2)  expanding  both  (3.44)  and  the  equality  constraint 
in  (3.35)  into  four  blocks;  (3)  substituting  the  expressions  for  and  into  those 
expanded  equations;  and  (4)  using  algebraic  manipulations  to  solve  for  an  expression  for 
and  to  find  equations  for  the  unknowns,  Q  and  R,  independent  of  A^.,  and  C^. 
The  necessary  conditions  in  final  form  are: 


(3.59) 


(3.60) 


(3.61) 

(3.62) 


where 


Q  d  gcT  +  V„  +  f  7^.4 j((?  +  Q)CJ 

i  =  1 

(3.63) 

^  +  E  liBi'^iP  +  P)Ai 

•  si 

(3.64) 

=  Ap^  A-BR2-^'3> 

(3.65) 

and  Ac,  B^,  and  are  r 


Hen  in  terms  of  the  unknown  variables  Q,  P,  Q,  and  P  as: 
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(3.66) 


(3.67) 

(3.68) 


3.5  Modifications  for  Stratonovich  Noise  Model 

As  mentioned  in  section  3.2,  there  is  a  simple  transformation  which  allows  us  to 
reinterpret  the  Ito  equation  (3.7b)  —  that  is,  our  state  equation  —  in  the  sense  of 
Stratonovich.  That  transformation  is  given  by  the  following  theorem. 


Theorem  S.S:  Given  the  vector  Stratonovich  equation, 

dxt  =  f{xt,i)dt  +  G{Xf,t)d^f ,  t>to  (3.69) 

where  Xj  has  dimensions  n  x  1  and  is  a  vector  Wiener  process  with 
E{d0fd0t'^}  =  Q{t)dt,  the  equivalent  Ito  equation  for  the  element  of  dx^  is 


dxit  =  |/.(x„  t)  +  ^  |^G'(x„  t)dPt]i ,  t>  to  (3.70) 


This  theorem  is  given  in  Jazwinski  (1970,  p.  131)  and  is  formally  proved  by 
Stratonovich  (1966).  It  shows  that  a  correction  term  must  be  added  to  the  first  term  in 
the  state  equations.  Comparing  (3.69)  with  (3.7b), 

dx,  =  4-[AjX,  A2X1  ...  Api,  G 

and  recalling  equations  (3.20), 

/(x,)  =  3x„  g(x,)=[:5,*,  1,x,  ...  3,x,  gJ 

(?  =  diag{7„72,...,7p,y}  (3.20) 


dvf 


(3.7b) 
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we  can  substitute  the  quantities  from  (3.20)  into  (3.70)  and  show  that  in  order  to  give 
the  multiplicative  white  noise  model  a  Stratonovich  interpretation,  we  need  to  make  the 
following  substitution, 

Ax,  Ax,  +  i  (3.71) 

Eliminating  the  common  factor,  x„  and  expanding  A  and  A,  into  2x2  block  form 
[see  (3.9)],  we  have 


A  RC, 

A 

BC,' 

1 

A.B.C, 

BJC  A, 

♦- 

K 

+5 

R,C.A, 

B,CiBA 

(3.72) 


Then  employing  once  again  the  assumption  that  for  each  t,  Bi  and  C,-  are  not  both 
nonzero,  and  examining  the  (1,1),  (1,2),  and  (2,1)  blocks  of  (3.72),  we  find  that  the 
Stratonovich  noise  interpretation  may  be  accomplished  by  means  of  the  three  simple 
substitutions  in  the  plant  model. 


A  A  +  A  E  liAi*  , 

^  .  =  1 

B  B  +  \  £  'liAiBi , 

^  i  =  l 

C  C  +  5  E 

^  i  =  l 


(3.73) 


3.6  Controller  Design  Summary 

The  multiplicative  white  noise  model  of  this  chapter  is  used  for  the  auxiliary 
inputs  of  the  LQG/PRE  error  model.  This  auxiliary  input  model  is  combined  with  the 
standard  cost  functional  penalties  on  the  auxiliary  outputs  to  produce  a  new  parameter- 
robust  design  procedure.  The  performance/robustness  tradeoff  is  accomplished  similarly 
to  LQG/PRE,  where  the  noise  intensities  are  aidjusted  by  means  of  the  scalar 
parameters  7,  (t  =  l,2,.,.,p),  and  auxiliary  output  penalties  are  adjusted  as  before,  by 
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modifying  the  state  and  control  weighting  matrices  {R^  and  iZj,  respectively)  as  in 
equation  (1.9)  and  substituting  these  modified  matrices  into  the  controller  design 
equations  developed  in  this  chapter. 
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4.  Optimal  Reduced-Order  Control 


One  of  the  great  advantages  of  the  parameter  robust  design  techniques  discussed 
in  Chapters  2  and  3  (as  well  as  LQG/PRE)  over  p-synthesis  (see  section  l.lB)  is  the 
fact  that  they  lend  themselves  to  application  of  a  method  of  optimal  reduced-order 
controller  design,  called  the  optimal  projection  equations  (Hyland  and  Bernstein  1984). 
As  is  well-known,  the  LQG-optimal  compensator  for  an  n^^-order  plant  model  is  also  of 
order  n.  However,  the  optimal  projection  equations  allow  us  to  specify  a  compensator 
order  <  n  and  directly  design  the  optimal  compensator  of  that  order,  provided  a 
stabilizing  nj‘*‘-order  compensator  exists.  The  optimal  projection  results  will  be  stated 
here  directly.  For  a  proof,  see  Hyland  and  Bernstein  (1984)  [for  the  case  of  no  cross¬ 
weighting  in  the  cost  and  no  cross-covariance  between  process  and  sensor  noise],  or  see 
Bernstein  and  Hyland  (1988a)  for  the  more  general  case. 

Given  the  n*‘*-order  plant, 

i{t)  =  Ax{t)  -f  Bu{t)  +  Giw{t)  (4.1) 

y{t)  =  Cx{t)  +  Gjwit) 

with  uncorrelated,  unit  intensity,  Gaussian  white  noise  vector,  w,  define  the  covariance 
matrices, 

V.^G.G/,  V„^G,G,\  V,^G,G/  (4.2) 

and  the  linear-quadratic  cost  functional, 

J{Ac,Bc,Cc)  =  limE{i|^i2iX,  -|-  2x,^i2i2ne  +  dt)  (4.3) 

T  *00 

If  a  stabilizing  nj.***-order  compensator, 

^c(0  =  ^c^e(0  +  (4-4) 

u(f)  =  CcX^(t) 

exists,  the  one  which  minimizes  (4.3)  is  given  by  the  design  equations, 
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(4.5) 


A,  =  r(A  -  BRj  '^9-  CIV2  -  ^C)G'^ 

B,  =  rQVj-^ 

C, =  -R^-^^G'^ 

where 

d^QC^  +  Vj,,  9^B’^P  +  Rj^'^  (4.6) 

Also  define 

(3.65) 

Then  the  n  x  n  matrices  Q,  P,  Q,  P,  and  r  are  the  solution  to  the  optimal  projection 
equations, 

(4.7) 

(4.8) 

(4.9) 

(4.10) 

(4.11) 


0  =  AQ  +  QA'^-O.V2~^d'^  +  Vi  +  T^dV2-^d'^Tj_'^ 
0  =  A'^P  +  PA  -  +  r  j.  X 

0  =  ApQ  +  QAp"^  +  QVj  -  -  r  j.  QKg  ■  X 

0  =  Aq^P  +  PAq  +  g^'^Pj ~  -  T  j.  VR2  -  j_ 

rank(Q)  =  rank(P)  =  rank(QP)  = 


where  r  jl  ^  I  —  t  and  the  optimal  projection  matrix,  t,  is  perhaps  most  simply 
described  as  follows.  Define  a  balancing  transformation  '9  (Laub  1980)  that 

A  A 

simultaneously  diagonalizes  Q  and  P  by. 


-  ig^r  - 1  =  =  A  (4.12) 

A  —  diag(A],A2,. . .,A,,  ,0,. ..,0} 

C 
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Then 


Also, 


T  = 


$-1 


(4.13) 


G  =  [/„^  0]’®'"  and  r  =  [/„^  0]«-^ 


(4.14) 


Note  that  the  modified  Riccati  equations  (4.7)-(4.8)  and  the  modified  Lyapunov 
equations  (4.9)-(4.10)  are  coupled  together  by  the  matrix  t  .  For  the  case  of  a  full- 
order  compensator  (n,.  =  n),  we  have  t  =  G  =  F  =  /  and  t  j_  =  0,  so  the  terms  involving 
r  ^  disappear.  In  that  case  (4.7)  and  (4.8)  become  the  standard  observer  and  regulator 
Riccati  equations  of  LQG  control,  and  equations  (4.5)  become  the  standard  state-space 
solution  for  the  controller.  Also,  in  the  full-order  case,  the  Lyapunov  equations  (4.9) 
and  (4.10)  become  superfluous.  Notably,  these  equations  must  be  solved  for  the 
observability  gramian,  Q,  and  the  controllability  gramian,  P,  if  one  wishes  to  balance 
and  truncate  the  plant  to  produce  a  suboptimal  reduced-order  controller.  The  coupled 
structure  of  the  optimal  projection  equations  shows  that,  in  a  sense,  the  balancing  and 
controller  design  must  be  carried  out  simultaneously  in  order  to  preserve  optimality.  In 
the  general  case,  where  possibly  n^.  <  n,  Q  and  P  are  referred  to  in  the  literature  as  the 
observability  and  controllability  pseudogramians. 

Clearly,  the  optimal  reduced-order  design  equations  apply  to  LQG/PRE,  since 
LQG/PRE  differs  from  standard  LQG  only  in  the  fact  that  the  values  of  V2,  Ri, 
and  R2  are  modified,  as  in  (1.9).  Optimal  projection  may  also  easily  be  applied  to  the 
firequency-domain  method  of  Chapter  2.  The  auxiliary  input  modeling  phase  of  that 
method  merely  involves  augmentation  of  the  plant  dynamics  and  augmentation  of  the 
covariance  matrices  Fj  and  F2  to  account  for  additional  noise  sources.  The  auxiliary 
output  modeling  phase  only  modifies  the  weighting  matrices  and  iJj.  Therefore,  the 
modified  model  of  the  plzuit,  the  modified  noise  model,  and  the  modified  cost  criterion 
still  constitute  a  standard  LQG  problem.  That  being  the  case,  the  optimal  projection 
equations  may  be  applied  directly. 

The  time-domain  method  of  Chapter  3,  however,  does  not  conform  to  the 
standard  LQG  framework,  because  the  multiplicative  white  noise  model  of  the  auxiliary 
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inputs  requires  a  change  in  the  structure  of  the  state  and/or  output  equations  of  the 
plaint.  Fortunately,  optimal  projection  may  still  be  applied.  The  optimal  projection 
equations,  as  stated  above,  may  be  derived  by  the  Lagrange  multiplier  method  of 
section  3.4  (Hyland  and  Bernstein  1984).  More  specifically,  the  ramk  conditions  (4.11) 
are  enforced  in  the  course  of  the  algebraic  manipulations  which  follow  after  taking  the 
partial  derivatives  of  the  Lagrangian.  In  fact,  the  same  rank  conditions  may  be  applied 
to  the  more  general  multiplicative  white  noise  problem  (Bernstein  and  Hyland  1988a). 
These  rank  conditions  did  not  appear  explicitly  in  Chapter  3,  because  the  algebraic 
manipulations  were  referenced  to  the  paper  just  mentioned.  Abo,  Chapter  3  was 
concerned  only  with  the  case  of  a  full-order  controller.  Of  course,  the  full-order  case  is 
still  important,  since  a  restriction,  <  n,  on  the  controller  order  generally  results  in  an 
increase  in  the  value  of  the  cost  functional,  J,  and  therefore  brings  about  a  degradation 
in  performance. 

By  including  multiplicative  white  noise  in  the  state  and  output  equations  of  the 
plant,  as  in  Chapter  3, 

dxt  =  Axtdi  +  f)  +  Butdt  -f  ^  7,B,u,dv<,  -b  Gid0t  (3.1) 

«  as  1  1  =  1 

dyt  =  Cx^dt  -1-  E  'liCiX.dvit  -I-  (3.2) 

i  =  l 

we  obtain  a  general  model  which  applies  to  all  of  the  parameter-robust  controller  design 
methods  discussed  in  this  work.  For  a  method  in  which  multiplicative  white  noise  is 
not  desired,  we  may  set  A,-  =  B,-  =  Cj  =  0  in  (3.1)-(3.2)  to  obtain  (4.1).  The  optimal 
n<.‘*’-order  compensator  (for  n<.  <  n),  if  a  stabilizing  compensator  of  that  order  exists,  is 
given  by 


A,  =  r(A-BB2-*^-QV^2’'C')G'^  (4.15) 

where  P,  Q,  P,  and  t  ^  are  the  solution  to  the  more  general  optimal  projection 
equations. 
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(4.16) 

(4.17) 

(4.18) 

(4.19) 

(4.20) 

The  X  n  matrices  T  and  (7,  and  the  n  x  n  matrix  r  j,  are  determined  by  Q  and  P  as 
described  above,  and  the  following  definitions  fiom  Chapter  3  apply. 


•=i 

(3.53) 

I  si 

(3.58) 

Q  ^  QCT  +  V,,  +  f:  ^,A,(Q  +  Q)Cy 

«  S  1 

(3.63) 

9  ^  B'^'P  +  +  t  +  P)Ai 

i  =  l 

(3.64) 

Aq^A-OVj-^C, 

(3.65) 

o^AQ+QA'^-aVi  ‘a'^+y,+Tj,ap2 

i  s  1 

0  =  A7p  +  PA  -  “ ‘9  +  «,  +  T  J,  '  V  J. 

+  £  7.{Ai^PAi  +  (Aj  -  QVj  ■  'Cj)-^P( A,.  -  Cli7j  '  'C;)) 

i  s  1 

0  =  ApQ  +  QAp'^  +  QV2~  -  T  j.QV2~ 

0  =  +  PAg  +  -  T  j.  Vfl2  "  X 

rank(Q)  =  rank(P)  =  raiik(QP)  = 


The  numerical  solution  of  the  coupled  Riccati-  and  Lyapimov-type  equations  in 
this  and  the  previous  chapter  require  an  iterative  algorithm.  The  algorithms  used  to 
solve  these  equations  will  be  discussed  in  Chapter  5. 
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5.  Methods  and  Algorithms  for  Controller  Design 


Chapters  2,  3,  and  4  described  the  basic  theory  needed  for  the  controller  design 
techniques  under  study.  This  chapter  is  concerned  with  the  practical  problems  involved 
in  applying  the  design  methods  to  an  actual  control  system.  In  the  sections  that  follow, 
a  few  of  the  more  important  implementation  considerations  are  discussed  which  arose 
over  the  course  of  this  research.  Solutions  are  given  to  these  problems,  and  algorithms 
are  described  in  detail. 


5.1  Selection  of  Cost  Functional  Weighting  Matrices  and  Covariance  Matrices 

When  relatively  little  is  known  about  the  physical  meaning  of  the  d}rnamics  of  a 
control  system’s  state-space  model,  it  is  common  practice  to  oversimplify  the  cost  and 
covariance  matrices  for  LQG  controller  designs.  One  method  is  to  assign  scalar- 
weighted  identity  matrices,  as  follows: 

Ri  =  /,  ■^12  —  0 

E{uu^}  =  /,  E{nn^}  =  E{im^}  =  0 

where  v  and  n  represent  the  process  and  sensor  noise  vectors,  respectively.  The  idea, 
then,  is  to  reduce  the  LQG  design  process  to  the  selection  of  two  scalar  parameters  — 
Pf  to  adjust  controller  authority,  and  p„  to  adjust  the  tracking  speed  of  the  Kalman 
filter.  Although  it  is  generally  not  possible  to  know  the  cost  matrices  which  will  give 
the  most  desirable  response  or  the  covariance  matrices  which  most  accurately  describe 
the  noise,  we  may  take  advantage  of  what  knowledge  we  have  about  the  simply 
supported  plate  under  study  to  take  a  more  meaningful  design  approach. 

A  large  cliiss  of  damped  flexible  structures,  including  those  modeled  by  finite- 
element  methods,  can  be  expressed  in  spatial  coordinates  by  the  following  differential 
equation: 


Mq{t)  +  cm  +  Kqit)  =  m 


(5.2) 


where  q  denotes  the  n-dimensional  spatial  displacement  vector  and  /  denotes  the 
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n-dimensional  applied  force  vector.  M,  C,  and  K  are  S3anmetiic  mass,  damping,  and 
stiffness  matrices,  respectively.  Assuming  the  structure  has  proportional  damping  (i.e.-, 
C  =  k^M  +  k^K,  for  some  scsdar  constants  k^^^  and  kf),  the  matrices  M,  C,  and  K  can  be 
simultaneously  diagonalized  by  a  left  and  right  multiplication.  In  other  words,  the 
system  may  be  decoupled  into  modal  coordinates  by  means  of  a  variable  substitution,  so 
that 


mfiiit)  +  CiXi(t)  +  kiXi(i)  =  fi(i),  i  =  1, 2, . . .,  n  (5.3) 

where  represents  the  “displacement”  of  the  mode.  The  proportional  damping 
jissumption  tends  to  hold,  for  all  practical  purposes,  when  the  damping  is  very  light,  as 
is  the  case  for  many  flexible  structures  and  in  particular  for  the  one  studied  in  this 
work.  Also,  the  identification  procedure  used  to  obtain  a  model  of  the  simply  supported 
plate  assumes  a  modal  system.  In  terms  of  modal  natural  frequencies  and  deunping 
ratios,  a;,-  and  C,-,  respectively,  an  equivalent  expression  for  (5.3)  is 

®i(<)  +  2C.w<ii(<)  +  Wi*Xi(t)  =  Ui(<),  t  =  1, 2, . . .,  n  (5.4) 

Then  the  modal  frequencies  are  expressed  in  terms  of  the  diagonalized  mass  and 
stifeiess  matrices  by 


(5.5) 


The  energy  contained  in  each  mode  is  the  sum  of  the  modal  potential  and  kinetic 
energies. 


Therefore, 


Ei  oc  +  X,* 


(5.6) 


(5.7) 


Rather  tham  penalizing  all  of  the  states  equally,  as  in  (5.1),  it  seems  reasonable  to 
penalize  the  total  energy  in  the  system.  Then  the  controls  may  be  penalized  as  before. 
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or  individually,  to  bal2ince  the  tradeoff  between  modal  energy  and  control  effort.  K  we 
order  the  displacements  and  velocities  into  a  state  vector,  x  =  i^]^,  this  b 

accomplished  by  the  cost  matrices, 

fi*  0  1 

^1=  Q  J  ,  R2=^^^&{pr^,Pr^,-PrJ,  Ru  =  0  (5.8) 

for  a  system  with  m  controls,  where  fl  =  diag{a>i,a;2,...,u;„}.  For  the  system  imder 
study  there  is  only  one  control  input,  so  this  technique  leaves  only  one  parameter  to 
adjust.  In  section  5.2  it  will  be  shown  how  the  cross-weighting  matrix,  R12,  may  be 
m2inipulated  to  suit  other  design  goals. 

The  covariance  matrices  can  also  be  given  a  more  sensible  structure,  although 
the  modifications  involved  may  not  be  significant  enough  to  gain  much  advantage.  For 
controller  design  purposes  the  state  and  output  equations  for  the  plant  are  expressed  in 
modal  coordinates: 

j  =  +  Bu  +  GiW  =  Ax +  Bu  +  [pji  <712] 

y  =  Cx  + Du  +  G2W  =  Cx  +  Du  +  [sf2i  922] 

It  is  assumed  here  that  the  process  and  sensor  noise  are  independent  of  each  other. 
Hence,  we  have  partitioned  the  exogenous  noise  vector  w  into  process  noise  v  and  sensor 
noise  n.  Since  sensor  noise  would  only  occur  in  the  output  equation,  gi2  =  0.  Also,  w  is 
assumed  normalized  by  Gi  and  G2  such  that  =  I. 

One  might  arbitrarily  assume  that  the  elements  of  n  each  affect  one  modal 
output  in  y  and  to  the  s^une  degree,  so  that  ^22  =  Pv^^^-  However,  any  judgment  about 
the  relative  intensities  and  distribution  of  the  sensor  noise  should  be  made  in  sensor 
coordinates.  Denote  the  sensor  measurements  as  a  sum  of  a  true  signal  and  sensor 
noise,  as  follows: 

=  y.  +  (S-io) 

The  relationship  between  the  sensor  outputs  and  the  modal  outputs  is  given  by 

=  %  (511) 
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where  the  sensor  eigenvector  (or  modeshape)  matrix  $  has  at  least  as  many  rows  as 
columns  and  indicates  the  relative  participation  each  modal  output  has  on  each  sensor. 
The  output  equation  in  modal  coordinates  is  approximated  by  the  least  squeaes 
solution, 


y  =  +  ^*g,n,  (5.12) 

where  is  the  least  squares  pseudoinverse  of  or  simply  the  inverse  if  $  is  square. 
Comparison  of  (5.12)  with  the  output  equation  in  (5.9)  reveals  that  simply  setting 
922  —  ignores  the  structure  of  the  modeshape  matrix.  Instead,  for  E{n,n,}  ^  J  we 

might  reasonably  set  g^  =  if  all  of  the  sensors  are  identical  and  function 

independently  of  one  another.  That  gives  us 

922«  =  (5.13) 

The  original  sensor  noise  vector  n  should  be  replaw:ed  by  the  possibly  higher  dimensional 
vector  giving 


922  =  (5.14a) 

but  that  does  not  affect  the  dimension  of  —  52i92i^  +  922922^ •  Therefore, 

^2  =  92192/  +  =  92\921^  +  Pvi^'^^)  ~ '  (5.15) 

Note  that  the  S2une  result  for  V2  may  be  arrived  at  by  replacing  (5.14a)  with 

=  (5.14b) 

This  alteration  has  no  effect  on  V],  =  either  because,  as  was  already  indicated, 

gi2  =  0-  Using  (5.14b)  allows  us  to  retain  the  lower  dimensional  sensor  noise  vector  n. 
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5.2  Disturbance  Cancellation  and  Inverse  Optimal  Control 

When  the  process  noise  disturbance  is  expected  to  lie  in  some  limited  frequency 
range,  it  is  a  highly  inefficient  allocation  of  control  effort  to  attempt  to  reject  that 
disturbance  by  merely  lowering  the  control  penalty,  p,.,  and  sensor  noise  covariance,  p„, 
until  the  desired  degree  of  rejection  is  achieved.  That  procedure  results  in  greater 
disturbeince  rejection  at  all  frequencies  and  potentially  requires  an  unacceptable  amoimt 
of  control  effort  (as  well  as  excessive  sensitivity  to  sensor  noise)  to  achieve  the  specified 
level  of  rejection  in  the  anticipated  disturbance  frequency  band.  Rather  than  raise 
controller  authority  at  all  frequencies,  it  is  more  desirable  to  concentrate  the  control 
effort  where  it  is  needed. 

Suppose  we  have  the  following  plant  and  colored  disturbance  model,  respectively: 


ip  =  Axp  +  Bu  +  G^w 

(5.16) 

y  =  CXp  ADu-\-  g^yW  +  Sf22« 

(5.17) 

W  =  C^X^ 

(5.18) 


(5.19) 


Augmenting  the  plant  with  the  shaping  filter  dynamics  yields 


If  we  assume  complete  knowledge  of  the  augmented  state,  the  feedback 

u=  -Fx^  ~[F^  FJ 

[Xu, 

produces  the  closed-loop  system  and  output  matrices. 
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(5.20) 


.  -4  —  BF  p  —  BF^ 

^el  = 

0 

C'c/  =  ^  C  —  DF p  92\G^  —  DF^ 

If  g^x  is  nonzero,  there  are  two  components  of  the  disturbance  —  one  which  enters 
through  and  influences  the  states,  and  the  other  which  enters  by  way  of  ^21  and 
feeds  through  to  the  outputs.  Both  components  are  completely  canceled  if  F^  is  chosen 
such  that 


G^C^-BF^  =  {i  (5.21a) 

g^xC^-DF^^Q  (5.21b) 


Both  conditions  cannot  in  general  be  met  simultaneously.  However,  the  type  of 
problem  under  study  has  a  favorable  structure  in  this  regcird.  For  the  n„-mode  flexible 
structure  model  studied  in  chapters  6  and  7,  the  control  input  and  disturbance  are 
forces  and  affect  only  the  derivatives  of  the  velocity  states.  Also,  the  measurements  axe 
modal  accelerations.  Therefore,  the  matrices  5,  ,  D,  and  ^21  take  the  form. 


B  = 


(5.22) 


D  =  h,  g2x=  9 


and  both  (5.21a)  and  (5.21b)  reduce  to: 


bF^  =  gC^ 


(5.23) 


A  solution,  F^,  that  satisfies  (5.23)  cannot  be  foimd  in  general  unless  the  n„  x  m  matrix 
b  hcis  rank  n^.  This  condition  requires  that  the  number  of  actuators  be  at  least  as  great 
as  the  number  of  modes  in  the  plmt  model  (i.e.,  m  >  n^).  If  m  <  it  may  not  be 
possible  to  completely  cancel  the  disturbance,  but  we  cm  still  choose  F^  to  cancel  the 
effect  of  the  disturbance  on  m  of  the  modes  by  selecting  the  corresponding  m  rows  of  b 
and  g  to  replace  the  full  matrices  in  (5.23). 
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If  the  state  feedback  (5.19)  must  pass  through  a  low  pass  smoothing  filter  before 
entering  the  plant  through  B,  the  above  technique  may  still  be  applied,  but  it  is  no 
longer  possible  to  cancel  the  disturbance  at  all  frequencies  simult2ineously.  Define  the 
smoothing  filter  transfer  function,  ifj(s)  =  Cj(s/ —  Then  the  condition  on 

which  produces  exact  cancellation  of  the  disturbance  at  frequency  Uj  is: 

bCiijujJ  -  At)  -  ^BtF^xJjuj)  =  gC^x^Uuj)  (5.24) 

The  vector  x^{ju)j)  is  found  by  applying  a  sinusoid  of  frequency  uj  to  the  input  of  the 
disturbance  shaping  filter  (5.17)  and  measuring  the  response  of  x^  in  the  frequency 
domain.  Assuming  u  is  a  scalar,  as  is  the  case  in  our  single  disturbance  input  model,  we 
set 

v{t)  =  sin(u;^t)  (5.25) 

Then,  after  talking  the  Fourier  tr«insform  of  (5.17)  and  solving  for  x^  we  have 

x{ji^d)  =  -  AJ  -  ^B^v{jua)  (5-26) 


Substituting  (5.26)  into  (5.24)  and  eliminating  the  scalar  v{ju>j)  from  both  sides  of  the 
equation,  we  arrive  at: 


6C|(ju>j/ -  A,) 


(5.27) 


When  estimator-based  feedback  is  necessary,  infinite  disturbance  rejection  is  no 
longer  possible,  but  the  disturbance  cancellation  technique  may  still  be  used  to  provide 
good  rejection  if  the  estimates  of  the  disturbance  states  are  accurate.  Therefore,  the 
effectiveness  of  this  technique  depends  on  the  accuracy  of  our  knowledge  of  the  plant 
(particularly  at  the  disturbance  frequency,  where  the  control  effort  is  concentrated)  and 
on  the  speed  of  the  estimator. 

Conceptuadly,  disturbance  c<incellation  depends  on  the  separability  of  the 
estimator  and  regulator  designs.  The  method  just  discussed  assumes  that  the  estimates 
of  the  disturbance  states  are  available  for  feedback  so  that  the  corresponding  regulator 
gains  can  be  designed  to  cancel  the  disturbance.  This  restriction  presents  a  problem 
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when  the  plant  model  involves  multiplicative  white  noise  (as  with  the  time-domain 
technique  of  Chapter  3)  or  when  an  optimal  reduced  order  controller  (see  Chapter  4)  is 
desired.  In  both  cases,  the  separation  principle  of  LQG  does  not  hold,  beca\ise  the 
controller  solution  involves  coupled  Riccati-  and  Lyapunov-type  equations  [see  (4.16)- 
(4.19)].  As  will  be  discussed  in  section  5.5,  however,  its  application  is  most  desirable 
under  precisely  these  conditions. 

Fortunately,  disturbance  cancellation  can  be  adapted  to  problems  involving  a 
coupled  regulator  and  estimator  by  appljring  an  idea  based  on  inverse  optimal  control. 
The  inverse  problem  of  optimal  control  for  linear  state  feedback  problems  was  first 
investigated  by  Kalman  (1964).  Given  a  linecir  state  feedback  law,  u  =  —  Fx,  the 
inverse  problem  is  concerned  with  finding  all  cost  functionals, 

J  =  5  /  [x(i)"^i2ia;(<)  -I-  2x{t)'^ Ri2u{t)  -|-  u(/)’^i22“(*)]  (5.28) 

for  which  the  control  law  is  optimal.  That  is,  given  F,  find  all  corresponding  matrix 
triples  Returning  to  the  disturbance  cancellation  problem,  F  is  considered 

known,  because  we  can  set  Fp  =  0  and  solve  (5.23)  or  (5.27)  for  assuming  for  the 
moment  that  we  have  a  standard  LQG  problem.  By  applying  inverse  optimal  control, 
we  can  then  solve  for  cost  matrices  —  iZi,  iJi2,  and  R2  —  which  provide  us  with  a 
regulator  to  cancel  the  anticipated  disturbance.  These  cost  matrices  may  then  be  used 
to  design  a  controller  with  coupled  regulator  and  estimator  for  problems  involving 
multiplicative  white  noise  or  optimal  reduced-order  control. 

Kalman  resolved  the  inverse  optimal  control  problem  for  single-input  systems, 
with  the  restriction:  R12  =  0.  He  showed  (Kalman  1964,  Theorem  6)  that  a  solution, 
{i2i,il2}>  exists  if  2uid  only  if  a  certain  condition  on  the  return  difference  function  — 
now  known  as  the  Kalman  inequality  —  holds.  The  restriction  on  R12  is  not  a  concern 
for  many  problems,  because  any  crossweighting  may  effectively  be  eliminated  by  a 
suitable  modification  of  the  system  matrix  and  the  addition  of  aurtificial  state  feedback 
to  compensate  for  that  modification.  In  estimator-based  feedback  systems  where  the 
separation  principle  does  not  hold,  however,  that  method  of  formulating  an  equivcdent 
problem  is  no  longer  valid. 

Kreindler  and  Jameson  (1972)  showed  that  if  a  nonzero  crossweighting  is 
cillowed,  the  inverse  problem  edways  has  a  solution.  Since  they  were  concerned  with  the 
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conventional  problem  of  finding  all  solutions,  tbeir  approaches  are  so 

general  (e.g.,  allowing  Riccati  solutions  not  to  be  positive  definite)  that  they  make  it 
difficult  to  find  a  single  solution. 

When  only  a  single  solution  is  sought,  as  is  the  case  here,  the  inverse  optimcd 
control  problem  is  much  simpler.  Let  I  and  assume  that  the  state  feedback  law 
which  provides  complete  disturbance  cancellations  given  by:  u  =  —  Fx.  Then  the 
integrand  of  the  cost  functional  (5.28)  can  be  rewritten  as  follows: 

4-  2i^i2i2tx  +  (5.29) 

=  X^(Ri  -  i2j2i22  ~  +  (u  +  il2  “  ^i2i2^x)^i22(u  +  ^2  “  ^^12^®) 

=  x’^iR,  -  RnRx2^)x  +  x\R^;^  -  F)'^{Rn'^  -  F)x 

Since  a  negative  cost  is  physically  impossible,  we  must  assume  that 

Ri  -  R12R2  ~  ^Rn'^  =  i2,  -  i2i2-Ri2'^  >  0  (5-30) 

where  “  >  0”  denotes  positive  semidefiniteness.  Therefore,  a  set  of  cost  matrices  for 
which  F  minimizes  the  cost  functional  (in  fact  makes  it  zero)  is: 


{R,,R,2,R2}  =  {F'^F,F'^,I} 


(5.31) 


This  means  that  in  order  to  apply  the  disturbance  cancellation  technique  to 
multiplicative  white  noise  or  optimal  reduced-order  control  problems,  we  can  use  the 
following  algorithm: 


(1)  Solve  (5.23)  or  (5.27)  for  F^ 

(2)  Arbitrarily  set  F^  =  0,  so  that  F  =  [0  F^] 

(3)  Compute  the  desired  cost  matrices  according  to  (5.31) 

(4)  Design  the  controller  using  the  cost  matrices  from  step  (3) 


As  in  the  case  of  LQG  problems,  the  effectiveness  of  this  algorithm  depends,  in  a  sense, 
on  a  good  estimator,  since  accurate  knowledge  of  the  states  was  assumed.  Even  though 
there  is  no  explicit  estimator  in  these  modified  LQG  controllers,  the  algorithm  stated 
above  hais  worked  well  consistently  for  “large”  (corresponding  to  a  fast  estimator). 
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lu  the  event  that  some  transient  suppr^sion  is  desired  in  addition  to  good  steady 
disturbance  rejection,  the  2dgorithm  may  be  modified  by  abiding  a  penalty  to  the  plant 
states.  The  above  algorithm  yields 


Ri  = 


0  0 
0 


(5.32) 


The  desired  transient  suppression  is  accomplished  by  replacing  the  zero  submatrix  of 
the  (1,1)  block  with  a  sufficiently  “large”  positive  semidefinite  matrix.  That  matrix 
may  be  chosen  by  penalizing  modal  energy  as  in  section  5.1  (i.e.,  by  choosing  some 
multiple  of  from  (5.8)  to  fill  the  (1,1)  block).  Experience  has  shown  this 
modification  to  the  algorithm  to  be  very  effective. 


5.3  Discrete-Time  Controller  Design 

Continuous-time  controllers  tend  to  be  more  convenient  for  theoretical  work  and 
for  frequency-domain  analysis,  but  implementation  of  a  control  law  on  a  digital 
controller  requires  the  design  of  a  discrete-time  controller.  Ideally,  the  discrete-time 
controller  is  directly  designed  from  sampled-data  model  of  the  control  system.  Since 
discrete-time  design  algorithms  are  readily  available  for  the  standard  LQG  problem,  this 
direct  design  approach  was  used  to  implement  controllers  on  the  hardware  (see  Chapter 
7)  when  neither  multiplicative  white  noise  nor  optimal  projection  was  involved.  The 
development  of  the  sampled-data  version  of  those  modified  LQG  problems,  as  well  as 
the  algorithms  necessary  to  solve  for  their  respective  compensators,  is  beyond  the  scope 
of  this  work  emd  arguably  could  be  considered  a  duplication  of  effort.  A  good 
approximation  to  the  directly  designed  discrete-time  controller  can  be  obtained  by 
discretizing  the  corresponding  continuous-time  controller.  Of  course,  the  approximation 
is  pcirticularly  good  when  the  sample  rate  is  high.  Therefore,  it  is  fortunate  that 
optimal  reduced-order  controllers  tend  to  mcdce  higher  sample  rates  possible. 

The  remainder  of  this  section  discusses  two  considerations  which  should  be  taken 
into  account  in  designing  a  discrete-time  controller. 

A.  Conversion  of  Cost  and  Covariance  Matrices  to  Discrete-Time 

In  sampled-data  controller  design,  the  continuous-time  model  of  an  actual 
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continuous-time  plant  is  translated  into  a  discrete-time  equivalent,  and  there  is 
relatively  little  intuitive  understanding  for  the  meaning  of  the  resulting  discrete-time 
vMiables.  Therefore,  rather  than  designing  the  compensator  entirely  in  discrete-time,  it 
mcikes  more  sense  to  specify  the  cost  functional  and  noise  covariances  in  continuous¬ 
time  and  then  translated  them  to  discrete-time  for  computing  the  optimal  discrete-time 
controller.  Assume  the  continuous-time  plant  model, 

x{t)  =  Ax{t)  +  Bu{t)  +  toi(t)  (5.33) 

y{t)  =  Cx{t)  -I-  Du{t)  -I-  W2{t) 

where  the  noise  covariances  are  given  by: 

e|  Kno  =  J't 

[w)J  j  [^12  V2 

And  define  the  zero-order  hold  equivalent  of  the  plant: 

Xk  +  1  =  +  r(A<)ufc  -f  u;,^  (5.35) 

yk  =  Cxk  +  Du,,  +  W2^ 

where  At  is  the  sampling  interval,  the  subscript  denotes  the  sampling  instant  for 
A:  =  0,1,2,...  (i.e.,  —  ^(^fc)i  where  the  A:***  sampling  instant  occurs  at  time  t^),  and 

$(At)  =  e^’^*  (5.36) 

T{At)  =  J^t)dt’B 
0 

tWi  =  /  ‘*  +  ^  m  -  tk)wi{t)dt  (5.37) 

*  ‘fe 

W2^  =  W2(tk) 

Then  the  discrete-time  equivalent  to  the  continuous-time  cost  ftmctional,  (5.28), 
takes  the  form, 

J  =  ^  (5-38) 
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where  the  discrete-time  cost  matrices  are  denoted  with  carats  and  are  given  by  (Stengel 
1986,  pp.  276-7): 


0 

Bi2=7V(*)lfl.r(t)+«ul<i< 

0 

R2  =  /(r‘^(t)i2ir(<)  -I-  r'^(<)i2,2  +  +  R2]dt 

0 


(5.39) 


The  discrete-time  equivalent  covariance  matrices  are  computed  by  their 
definitions,  using  (5.34)  and  (5.37)  as  follows: 


V'. 


4  E 

=  E 1^/'*  ♦ '  *(<  -  t,)w,m  •/'*  +  >  u./(()*T(,  -  (,)*j 

^  j*k  +  ij*k  +  i 
•fc  *k 

=  /  ‘*  +  ^  $(<  -  <t)  /  **  +  >  ViS(t  -  r)$T(r  -  t,,)dTdt 
‘k  *k 


(5.40) 


f  mVi^'^{t)dt 
0 


=  E 


/  *  +  *  ^t-tk)Wi{t)dt-W2^{tk) 


=  f^k  +  l  $(<  _  <^)E[u,j(f)u;/(<t)](ii 
*k 


■» 


(5.41) 
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V,  A  E  V] 

=  E  [W2{tk)^2'^itk)] 


(5.42) 


=  ^2^(0)  ??? 

From  (5.42)  we  see  that  the  literal  translation  to  discrete-time  resiilts  in  an  infinite 
sensor  noise  covariance.  If  (5.42)  were  used  for  the  controller  design  computations,  the 
Kadmem  filter  would  degenerate  to  an  open  circuit.  The  problem  airises  from  the  fact 
that  the  ^-correlated,  infinite-power  white  noise  model  assumed  by  LQG  theory  does 
not  reflect  reality.  Intuitively,  it  seems  desirable  to  retaiin  the  structure  of  the 
covairiance  derived  in  (5.42),  while  chainging  the  scalar  multiplier  to  a  finite  number. 
Gelb  (1984,  p.l21)  argues  in  favor  of  using 

(5.43) 

because  the  resulting  discrete- time  sensor  noise  covariance  approaches  the  continuous¬ 
time  covariance  as  At-*0.  That  was  the  method  used  for  this  study,  although  the  actual 
scalar  multiplier  used  is  not  important,  since  it  tends  to  become  a  design  parameter  in 
practice. 

The  matrix  integrals  used  to  compute  the  discrete-time  cost  and  covariance 
matrices  are  easily  evaluated  numerically  —  for  example,  by  means  of  the  forward 
rectangular  rule.  Only  20  to  30  intervals  were  required  for  good  accmacy  in  the 
examples  of  this  study,  but  fewer  eu’e  necessary  for  smaller  At. 


B.  Computational  Delay  \ 

Standard  LQG  compensators  (i.e.,  those  with  a  Kalman  filter  sepcirate  from  the 
regulator)  are  easily  modified  to  overcome  the  computational  delay  inherent  in  discrete¬ 
time  controllers.  For  a  one-s2unpling-interval  time  delay,  the  Kadman  filter 
extrapolation  and  update  equations  need  merely  to  be  reordered  to  turn  the  Kalman 
filter  into  a  one-step- ahead  predictor.  The  predicted  state  estimates  are  then  fed  back, 
by  means  of  the  regulator,  so  that  the  proper  state  estimates  reach  the  actuators  at  the 
proper  time. 
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In  problems  involving  multiplicative  white  noise  and/or  optimal  projection,  the 
lack  of  an  explicit  estimator  makes  that  procedure  impossible.  If  the  continuous- time 
controller  is  simply  discretized,  there  will  be  a  one-intervad  time  lag  between  the  time  of 
the  intended  controller  command  and  the  instant  that  command  reaches  the  actuators. 
For  faster  sample  rates,  the  problem  is  smaller  and  may  not  be  significant.  In  case  the 
computational  delay  is  significant,  it  should  be  modeled  in  the  control  system. 

A  pure  one-interval  delay  is  precisely  modeled  by  inserting  the  irrational  transfer 
function 

(5.44) 


between  the  commanded  control  signal  leaving  the  controller  and  the  actual  control 
input  entering  the  plant.  However,  since  is  infinite-dimensional,  a  suitable  finite¬ 

dimensional  approximation  must  be  found  —  preferably  of  as  low  order  as  possible, 
because  additional  dynamics  tend  to  complicate  controller  design  computations  and 
increase  controller  order.  Fade  approximations  to  e~* ‘ are  useful  in  this  context.  A 
Fade  approximation  tends  to  be  more  accurate  than  the  Taylor  series  approximation  of 
the  same  order,  and  it  is  easily  computed  (Bender  and  Orszag  1978,  secs.  8.3  and  8.4). 
As  an  example,  the  second-order  Fade  approximation  to  (5.44)  is  given  here: 


H  (b)  —  ^  •  At  •  At)^ 


(5.45) 


s^--S-s-j--12. 


5.4  Iterative  Relaxation  Algorithm  for  Solving  Coupled  Riccati/Lyapunov  ESquations 

As  mentioned  in  section  4.1,  problems  involving  multiplicative  white  noise  or 
optimad  controller  order  reduction  require  an  iterative  method  to  find  a  solution  to  the 
coupled  extremal  equations.  A  number  of  different  algorithms  are  described  in  the 
literature,  but  they  all  belong  to  one  of  two  categories  (or  are  a  hybrid  of  the  two)  — 
iterative  relaxation  methods  and  continuation  (or  homotopy)  methods.  The  idea  behind 
the  iterative  relaxation  approach  is  to  alternately  solve  the  coupled  equations  for  one 
unknown  at  a  time  while  treating  the  other  unknowns  as  constants.  The  main 


advantage  of  this  method  of  solution  is  that  it  requires  no  prior  analytical  development, 
assuming  standard  Riccati-  and  Lyapunov-equation  solvers  are  2ilready  available,  and  it 
is  therefore  easily  and  quickly  implemented.  The  main  disadvantage  is  that  this  type  of 
algorithm  is  not  guaranteed  to  converge  to  a  solution. 

The  iterative  relaxation  algorithm  detailed  in  this  section  is  a  variation  on  one 
described  by  Bernstein  and  Hyland  (1988b,  pp.  290-91).  The  algorithm  contains  two 
nested  loops  and  is  designed  to  handle  the  general  reduced-order  controller  problem  in 
the  presence  of  simultaneous  state-,  control-,  and  measurement-dependent  noise.  More 
specifically,  it  attempts  to  solve  the  general  optimal  projection  equations,  (4.16)-(4.20), 
for  Q,  P,  Q,  P,  and  r  ^  and  subsequently  to  find  the  controller  state-space  matrices  — 
Ac,  J?c,  and  —  using  (4.15).  If  a  full-order  controller  is  desired,  the  algorithm  is 
easily  modified  by  removing  the  outer  loop. 

Define  the  absolute  norm  of  an  m  x  n  matrix  M  as  follows: 

IIM 11^  =  max(A/,^,  t  =  l,2,...,m;  j  =  l,2,..,,n)  (5.46) 

The  iterative  relaxation  algorithm  used  in  this  study  will  now  be  stated: 

(1)  If  the  Stratonovich  multiplicative  noise  interpretation  is  desired,  modify  A,  B, 

and  C  using  the  substitutions  prescribed  in  (3.73). 

(2)  Perform  initializations: 

(a)  Let  7,-  =  0  (i  =  l,2,...,p)  and  r  =  I„  (i.e.,  r  =0). 

(b)  Solve  Riccati  equations  (4.16)  and  (4.17)  for  initial  values  of  Q  and  P. 

(c)  Solve  Lyapunov  equations  (4.18)  ^lnd  (4.19)  for  initial  Q  and  P. 

(Note:  this  initialization  corresponds  to  the  standard  LQG  solution.) 
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Beginning  of  outer  loop: 

(3)  Update  the  optimal  projection  matrix,  t: 

(a)  Compute  a  balancing  transformation,  such  that: 

=  =  A  (5.47) 

A^diag{Ai,A2,...,A„} 

(Laub  1980).  Note:  all  A,  (t  =  l,2,...,n)  are  nonnegative  real. 

(b)  Perform  a  basis  rearrangement: 

Rearrange  the  Aj  so  that  Aj  >  Aj  > . . .  >  A„  ,  then  rearrange  the 
corresponding  n  columns  of  $  using  that  same  ordering. 


(c)  Compute 


where 


(5.48) 

(5.49) 


(4)  Test  for  convergence  of  Q,  P,  Q,  P,  and  t  (condition  to  leave  outer  loop): 

(a)  Compute  relative  errors,  as  follows. 

ea4||r.h.s.of(4.I6)l,/||V'.|U  (5.50)- 

ep4||r.h.s.of(4.17)pJJ,|t, 

eg  ^||r.h.s.  of  (4.18)y  OVr'Q’'ll. 

ep  ^|r.b.s.  of  (4.19)y 

(b)  If  max{eQ,ep,e-,ep}  <  ej  (the  outer  loop  tolerance),  go  to  (14). 

Otherwise,  go  to  (5). 

Beginning  of  inner  loop: 

(5)  Update  ¥2,  R21  ^  using  equations  (3.53),  (3.58),  (3.63),  and  (3.64). 
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(6)  Update  Q: 

Solve  Riccati  equation  (4.16)  for  Q  by  treating  the  terms  involving  7^  and 
as  constants,  effectively  adding  them  to  Uj.  The  7^  term  in  the 
definition  of  Q  should  be  added  to  Vjj,  effectively  defining  a  modified 
cross-covariance. 

(7)  Repeat  (5). 

(8)  Update  P: 

Solve  Riccati  equation  (4.17)  for  P  by  treating  the  terms  involving  7,-  and 
as  constants,  effectively  adding  them  to  Rj,  with  one  exception.  The 
7i  term  in  the  definition  of  shotdd  be  added  to  Ri2i  effectively  defining 
a  modified  cross-weighting. 

(9)  Repeat  (5)  and  update  Ap  using  equation  (3.65). 

(10)  Update  Q: 

Solve  Lyapunov  equation  (4.18)  for  Q  by  treating  the  term  involving  t  ^ 
as  a  constant,  effectively  adding  it  to  the  other  constant  term. 

(11)  Repeat  (5)  and  update  Aq  using  equation  (3.65). 

(12)  Update  P: 

Solve  Lyapunov  equation  (4.19)  for  P  by  treating  the  term  involving  t 
as  a  constant,  effectively  adding  it  to  the  other  constant  term. 

(13)  Test  for  convergence  of  Q,  P,  Q,  and  P  (condition  to  leave  inner  loop): 

(a)  Compute  maximum  relative  change  since  the  last  iteration,  as  follows. 

Ao^le'-Q'-’L/llo'L  (5.51) 

where  the  superscripts  indicate  the  iteration  niunber  of  the  inner  loop  — 
i  being  the  current  iteration. 
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(b)  If  max{Q,P,Q,P}  <  C2  (the  inner  loop  tolerance),  go  to  (3). 
Otherwise,  go  to  (5). 

(14)  Compute  G  and  F  using  equations  (4.14). 

(15)  Compute  Ag,  and  using  equations  (4.15). 


We  should  recall  that  the  optimal  projection  equations  ase  necessary  conditions 
for  the  solution  of  optimal  reduced-order  controllers.  According  to  Hyland  and 
Bernstein  (1988b,  p.  292),  for  an  n*^-order  plant  with  m  control  inputs  and  I  sensor 
outputs,  there  is  only  one  solution  if  >  min(n,  m,  /),  but  otherwise  there  may  be  as 
many  as 

(min(n,  m,  /)\  ^  [min(n,  m,  /)]! 

J  ~  [min(n,  m,  /)  -  n^]!  •  nj 

solutions.  The  choice  of  initial  values  for  Q,  P,  Q,  and  P  corresponding  to  the  LQG 
solution  in  step  (2)  is  an  attempt  to  begin  the  iterations  as  close  as  possible  to  the 
optimal  solution  sought,  therefore  minimizing  the  risk  of  convergence  to  a  suboptimad 
solution  and  reducing  computation  time.  However,  for  the  problems  treated  in  this 
study,  m  =  1.  That  allowed  the  controller  order  to  be  specified  as  low  as  needed 
without  concern  for  the  existence  of  multiple  solutions  to  the  optimal  projection 
equations. 

The  most  difficult  parts  of  the  algorithm  to  establish  are  the  convergence  tests 
and  tolerance  specifications.  These  are  important,  since  a  test  which  is  too  stringent 
will  never  be  satisfied,  and  a  test  which  is  too  relaxed  will  not  allow  the  algorithm  to 
converge.  In  either  case,  the  algorithm  becomes  an  infinite  loop.  Unfortimately,  the 
tolerances  must  be  adjusted  to  suit  the  problem  at  h2Uid,  because  the  values  of  the  most 
effective  tolerances  are  somewhat  sensitive  to  the  pliuit  model  and  even  to  controller 
authority.  Although  both  tests  are  in  some  sense  normalized,  they  are  not  universally 
applicable  with  the  same  toleramces.'  When  the  algorithm  is  repeatedly  applied  to  the 
saune  plant  model,  acceptable  tolerances  can  be  chosen  after  some  experimentation. 
Otherwise,  it  is  advisable  to  do  one  or  more  of  the  following:  (1)  monitor  more  than  one 
indicator,  (2)  make  the  tolerances  automatically  adaptive,  or  (3)  give  the  software  the 
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capability  to  accept  lUcinual  changes  in  the  tolerances  between  iterations. 

The  convergence  test  used  for  the  inner  loop  —  step  (12)  —  was  chosen  because 
it  is  very  cheap  computationally  to  monitor  changes  in  values  since  the  l<ist  iteration. 
This  type  of  test  has  also  proven  to  be  quite  effective  in  determining  convergence,  even 
when  compared  to  more  sophisticated  tests.  The  outer  loop  convergence  test  —  step  (4) 
—  was  suggested  by  Richter  and  Collins  (1989).  It  seemed  to  be  the  logical  choice  for 
the  outer  loop  (or  for  the  only  loop  if  both  <ire  not  present),  because  it  monitors  the 
equation  errors  directly,  although  it  requires  much  more  matrix  arithmetic.  Still 
another  convergence  test  for  the  outer  loop  was  suggested  by  Hyland  and  Bernstein 
(1988b,  p.  291): 


for  some  tolerance  e.  This  test  is  based  on  the  property  —  trace(T)  =  —  when 

complete  convergence  has  been  attained.  To  see  why  this  property  holds,  recall 

equations  (4.13)  and  (4.14),  and  note  that 

r  =  G'^r,  (5.53) 

Then 

trace(r)  =  trace(G^r)  =  trace(rG^)  =  (5.54) 


trace(rj 


Step  (5)  —  the  update  of  matrix  values  which  appe«ir  in  the  Riccati-  and 
Lyapunov-type  equations  —  is  repeated  in  several  places  throughout  the  adgorithm  in 
order  to  give  those  equations  the  most  recent  information.  It  may  be  omitted  in  some 
places  in  order  to  save  a  little  computation  time,  but  the  2dgorithm  may  diverge  if  it  is 
not  repeated  frequently  enough. 


5.5  Homotopy  Algorithm  for  Solving  Optimal  Projection  Elquations 

Several  homotopy  adgorithms  have  been  developed  to  solve  the  optimal 
projection  equations,  but  as  of  this  writing  they  have  only  been  hinted  at  (e.g.,  Richter 
1987;  Richter  and  Collins  1989)  and  have  not  been  published  in  explicit  form  vx  archival 
journals.  For  that  reeison,  a  homotopy  algorithm  is  developed  here,  although  with  the 


67 


restriction  that  it  applies  to  optimal  projection  only  (i.e.,  multiplicative  white  noise  is 
not  included).  A  similar  algorithm  for  the  more  general  case  may  be  derived  based  on 
the  principles  given  in  this  section,  although  it  would  likely  be  much  more  complicated. 
The  presence  of  multiplicative  white  noise  adds  severed  new  terms  to  the  Riccati  and 
Lyapunov  equations,  resulting  in  the  appearance  of  all  five  unknowns  (Q,  P,  Q,  P,  and 
t)  in  all  four  equations  when  state-,  control-,  and  measurement-dependent  noise  aire 
present. 

Examination  of  the  optimal  projection  equations  (4.7)-(4.11)  reveals  that  the 
degree  of  coupling  among  the  Riccati-  and  Lyapunov-type  equations  is  directly  related 
the  “size”  of  the  matrices  Vj  ^nd  Pj.  When  the  sensor  noise  covariances  and  control 
penalties  are  large,  the  coupling  is  reduced.  However,  when  lairge  controller  authority 
(i.e.,  small  md  Pj)  is  desired,  the  interaction  among  these  equations  tends  to  be 
great,  and  the  iterative  algorithm  of  section  5.4  is  less  likely  to  converge  to  a  solution. 
This  problem  frequently  occurred  when  the  disturbance  cancellation  method  of  section 
5.2  was  not  used  and  the  resulting  controller  authority  needed  to  meet  disturbance 
rejection  specifications  was  necessarily  high.  Rather  than  alternately  solving  for  Q,  P, 
Q,  and  P,  repeatedly,  the  homotopy  algorithm  replaces  this  inner  loop  iteration  with 
numerical  integration  and  finds  these  solutions  (for  fixed  t)  in  a  single  peiss.  It  does  so 
by  continuously  deforming  the  Riccati  equation  solutions  from  a  known  solution  (e.g., 
the  LQG  solution)  into  the  solution  sought. 

Homotopy  methods  of  solving  complex  equations,  such  as  the  one  described  here, 
are  developed  in  three  steps;  (1)  Find  the  solution  to  a  simple,  but  related  equation; 
(2)  Express  the  relationship  between  the  solutions  to  the  simple  and  complex  equations 
in  terms  of  a  differentied  equation;  and  (3)  (Numerically)  integrate  the  differential 
equation  to  obtain  the  solution  to  the  complex  equation.  Given  a  function  P:R"-»R”, 
suppose  we  seek  the  solution,  u,  to  F(u)  =  0.  Define  a  function  x  [0, 1]-*R”  such 
that: 


(a)  H{u{q),  q)  =  0  for  a  e  [0, 1]  (5.55) 

(b)  P(u,l)  =  P(u) 

(c)  a  solution  u(0)  to  P(u(0),0)  =  0  is  known 

Then  if  H  is  iitinuous  and  [dH ldu)~^  exists  over  the  entire  interval  a  €[0,1],  the 
solution  u(l)  to  P(u(l),  1)  =  P(u)  =  0  may  be  found  as  follows.  First,  differentiate 
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H{u{a),a)  =  0  with  respect  to  a  using  the  chain  rule  for  vectors  (Graham  1981,  sec. 
4.3): 


dH{u{a),Q)  du{a)  dH{u{a),a)  dH{u{a),a)  _ 


(5.56) 


Then 


du{a) 

da 


{u{a\a)  (dH{u{a),a)\ 
da  ^  du(a)  j 


(5.57) 


Since  u(0)  is  known,  we  can  treat  (5.57)  as  an  initial  value  problem,  integrating 
du{a)lda  over  the  intervcil  from  0  to  1  to  obtain  the  solution  u  =  u(l).  For  a  more 
complete  discussion  of  homotopy  methods,  see  Richter  and  DeCarlo  (1983). 

Before  applying  the  homotopy  principle  to  the  modified  Riccati  equations  (4.7)- 
(4.8),  it  is  advisable  to  rewrite  these  equations  in  vector  form  in  order  to  avoid  the 
problem  of  taking  derivatives  of  a  matrix  with  respect  to  a  matrix.  Recall  equation 
(4.7): 

0  =  AQ  +  QA^-  OVj  ■  +  Vi  +  t  X  QVj '  ^  (4.7) 

where  Q  =  QC^  +  ¥^2,  and  assume  t  is  fixed.  Then  the  equivalent  equation  in  Q  that 
we  wish  to  solve  is: 


FqIvccQ,  vecQ”)  =  {A  ^  A)  vecQ  +  vec  Vj  —  vec(QV’2  +  r*vec((IlV2 


(5.58) 


where 


r  =  r  j_ 


(5.59) 


If  the  last  term  in  (5.58)  [or  (4.7)]  were  not  present,  we  would  have  a  standard  Riccati 
equation,  whose  solution  is  easily  found.  Therefore,  a  logical  choice  for  the  function  H 
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HQ(vecQ,vecQ’^,a)  =  (.4  ®  A)  vecQ  +  vec  Vj  —  vec(QK2  ”  *Q^)  +  aT*vec(QV'2  ~ 

=  0  (5.60) 

For  a  =  0,  (5.60)  is  equivalent  to  the  standard  Riccati  equation,  and  for  q  =  1,  we 
recover  the  modified  Riccati  equation  (4.7).  Denote  the  initial  solution  for  Q  (i.e.,  the 
LQG  solution)  by  Q®,  corresponding  to  the  constant  projection  matrix:  r®  =  =  0. 

Equation  (5.60)  is  suitable  if  we  wish  to  solve  (4.7)  only  once,  for  constant  t  =  where 
T*  has  been  computed  based  on  the  initial  vajues  of  Q  and  P.  However,  we  also  wish  to 
solve  for  t,  so  the  outer  loop  of  the  algorithm  will  be  updating  the  projection  matrix, 
giving  r',  i  =  1,2,....  Therefore,  on  the  iteration  we  need  to  continuously  deform  a 
known  solution  Q*~^  into  Q‘  based  on  our  knowledge  of  t*  and  t*~^.  To  that  end, 
consider  the  function, 

ffg(vecQ',vecQ‘^,a)  =  {A®  A)  vecQ‘  +  vec  Vi  —  vec((!l'V2~'^‘^)  (5.61) 

+  r*‘  -  ‘vec(Q‘  *  ^^2  ' 

+  a[r**vec(Cl*V2  "  ^Q'^) 

-  r*‘  -  'vec(Q’ "  V2 "  *Q‘ "  ^'^)]  =  0 


and  note  that  it  satisfies  the  conditions  in  (5.55).  For  a  =  0,  the  solution 
(vecQ*  ~  *,vecCl' “  *  )  from  the  previous  iteration  satisfies  (5.61),  and  for  o  =  1,  (5.61)  is 
equivalent  to  (5.58).  Now  we  may  develop  the  initial  condition  problem  to  solve  by 
tahing  the  derivative  of  this  function  with  respect  to  a,  as  follows: 


^  dvecQ'  ^  dvecCl^ 

a  da  dvecQ'  ^  da  5vecQ’^ 


(5.62) 


By  the  definition  of  Q,  we  have: 


^2^  =  i  vec(C(?'  +  V,/)  =  (/  ®  0  vecQ'  =  ■  (/  ® 


(5.63) 


Therefore, 


dot  dyecQ 


^+(/®c’'") - ^  +-^  =  0 

avecQ* 


(5.64) 


70 


or: 


dvecQ'  _ 
da 


dj^ 

da 


5vecQ* 


9H^Q 

dvecO*^ 


where 


=  [r-vec(Q'V2  "  -  t**  "  Vec(Q'  “  ^  Vj  "  ^Q* "  ^'^)]* 


dvecQ* 


(AeA)'^ 


9H^Q 

5vecQ‘^ 


-\vr  ■O'’'  ®  /),„,  +  (/  ®  Vj  -  ■Q''’)](/  -  aT-‘r 


(5.65) 


The  properties  used  to  take  these  partial  derivatives  may  all  be  found  in  Graham 
(1981).  The  operator  denoted  by  the  subscript  “(n)”  is  also  borrowed  iErom  Graham 
(1981,  p.  71).  It  has  the  effect  of  reordering  the  rows  of  a  matrix  by  taking  the  first  row 
followed  by  each  subsequent  n‘**  row,  then  the  second  row  followed  by  each  subsequent 
n***  row,  etc.  Here,  n  is  the  order  of  the  (augmented)  plant  (i.e.,  A  E  R"*'"). 

In  summary,  we  may  solve  the  Riccati  equation  (4.7)  for  Q*  (given  a  fixed  t*)  by 
treating  the  previous  solution,  Q'~^  ,  as  the  initial  condition  at  o  =  0,  then  integrating 
(5.65)  over  the  interval  from  o  =  0  to  a  =  1.  The  starting  value,  Q®,  is  the  LQG 
solution  for  Q  (i.e.,  the  solution  for  r  j_  =  0). 

The  same  procedure  may  be  applied  to  the  other  Riccati  equation,  (4.8).  From 
the  definition  of  (4.6)  we  have: 

=  £  vec(BTpl  +  fl„T)  =  £  (7  g  pT)  ^  (5  66) 

which  leads  to: 
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The  Lyapunov  equations,  (4.9)-(4.10),  do  not  require  a  homotopy  solution  for 
fixed  r*,  because;  (a)  the  variables  Q  and  P  can  be  treated  as  fixed  constants,  since  they 
have  already  been  solved  for  on  the  first  pass,  and  (b)  the  term  involving  the  projection 
matrix  is  not  a  function  of  the  only  remaining  variable,  Q  (or  P),  and  may  therefore  be 
treated  as  part  of  the  constant  term.  That  is,  having  already  solved  the  modified 
Riccati  equations  for  Q  and  P,  the  modified  Lyapimov  equations  may  be  treated  as 
standard  Lyapunov  equations  and  be  solved  immediately  for  Q  and  P. 

The  entire  homotopy  algorithm  for  the  solution  of  the  optimal  reduced-order 
control  problem  (without  multiplicative  white  noise)  is  now  stated. 

(1)  Perform  initializations: 

(a)  Let  =  I„  (i.e.,  r  ®  =  0  and  t*®  =  0). 

(b)  Solve  Riccati  equations  (4.7)  amd  (4.8)  for  the  initial  values,  and  P®. 

(c)  Solve  Lyapunov  equations  (4.9)  and  (4.10)  for  initial  values  and  P^. 

(d)  Set  the  iteration  number:  i  =  0. 

(Note:  this  initialization  corresponds  to  the  standard  LQG  solution.) 
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Beginning  of  main  loop; 

(2)  Increment  the  iteration  number:  i*~i  +  1. 

(3)  Compute  the  optimal  projection  matrix,  t*  (based  on  and  exaw:tly 

as  in  step  (3)  of  the  algorithm  in  section  5.4. 

(4)  Test  for  convergence  of  Q‘“*,  Q‘~^,  P’~^  and  t’  (condition  to  leave 

mciin  loop): 

(a)  Compute  relative  errors,  as  follows. 

eg4||r.h.s.of(4.16)^/|V,|U  (5.50) 

ep^|r.li.s.of(4.17)|l,/lX.|U 

^||r.h.s.  of  (4.18)|l,/laV,-'Q-"|, 

op^lr.h.s.of(4.19)yi9>-f  fir's’ll. 

(b)  If  max{eg,ep,c~,ep}  <  c  (the  tolerance),  go  to  (12). 

Otherwise,  go  to  (5). 

(5)  Compute  and  9'“^  (based  on  Q'~^  and  P*"^)  using  equations  (3.63)  and 
(3.64),  and  compute  r**  using  (5.59). 

(6)  Compute  Q': 

Numerically  integrate  (5.65)  over  the  interval  from  a  =  0  to  a  =  1. 

(7)  Compute  P': 

Numerically  integrate  (5.67)  over  the  interval  from  a  =  0  to  a  =  1. 

(8)  Compute  Q*  and  9**  (based  on  Q*  and  P*)  using  equations  (3.63)  and  (3.64), 
and  update  Ap  and  Aq  using  equations  (3.65). 

(9)  Compute  Q: 

Solve  Lyapunov  equation  (4.9)  for  Q  by  treating  the  term  invloving  t  as 
a  constant,  effectively  adding  it  to  the  other  constant  term. 
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(10)  Compute  P: 

Solve  Lyapunov  equation  (4.10)  for  P  by  treating  the  term  invloving 
as  a  constant,  effectively  adding  it  to  the  other  constant  term. 

(11)  Go  to  (4). 

(12)  Compute  G  and  F  using  equations  (4.14). 

(13)  Compute  A^,  and  using  equations  (4.15). 
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6.  Evaluation  of  Designs 


The  controller  design  principles  developed  in  Chapters  2-5  were  appUed  to  an 
FDLTI  model  of  a  simply  supported  plate  in  order  to  study  their  effectiveness.  This 
chapter  describes  the  complete  control  system  model,  then  evaluates  the  comparative 
value  of  the  various  parameter-robust  and  optimal  reduced-order  controller  designs 
based  on  this  model.  Chapter  7  applies  the  analytical  techniques  of  this  chapter  to 
solve  the  problems  of  plant  uncertainty  and  high  controller  order  that  were  encountered 
in  the  actual  hardware. 

6.1  Problem  Description 

The  simply  supported  plate  is  made  of  cold-rolled  steel.  It  is  rectangular,  of 
dimensions  .5m  x  .6m  x  2.9mm,  and  has  attached  to  it  twelve  accelerometer  sensors  on 
one  side  and  two  point  force  actuators  on  the  other  side  —  one  for  the  control  input  and 
one  for  extern2d  disturbance  generation.  The  accelerometers  are  lightweight 
piezoelectric  devices  mounted  to  the  plate  with  wax  and  are  placed  in  a  3x4 
rectangular  array  with  locations  devised  to  assure  observability  of  the  first  twelve 
vibratioucd  modes  of  the  plate.  The  presence  of  more  sensors  than  observed  modes  (i.e., 
spatial  oversampling)  is  a  redundancy  that  tends  to  provide  greater  accuracy  in  the 
sensor  measurements.  The  actuators  are  electromagnetic  shakers  with  a  magnitude 
frequency  response  from  voltage  input  command  to  force  output  that  is  essenti£dly 
constant  over  the  frequency  range  of  interest.  They  are  placed  in  locations  coinciding 
with  two  of  the  sensor  positions  and  chosen  such  that  most  of  the  authority  of  these 
shakers  is  on  the  first  two  modes.  Also,  the  shakers  aire  located  near  node  lines  of 
modes  three  and  four,  with  the  intention  of  limiting  the  “spillover”  of  the  control  signal 
into  the  higher  frequency  modes.  For  the  experiments  carried  out  in  Chapter  7,  the 
disturbance  signal  was  produced  by  a  function  generator.  The  zero-order  hold  control 
signal  yvas  sent  through  a  second-order  low  pass  filter  before  reaching  the  control  shaker 
for  the  purpose  of  smoothing  the  signal  and  preventing  aliasing.  For  more  details 
concerning  the  hardware  configuration  of  the  simply  supported  plate  experiment,  see 
Rubenstein  (1991). 

The  plate  with  actuators  and  sensors  is  described  by  the  standard  state-space 

model. 
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ip(*)  =  9i2hpW 

yit)  =  CpXpii)  +  DpU(i)  +  [921  <722l«'p(<) 


(6.1) 


Expanding  the  matrices  in  (6.1),  we  de6ne  the  modal  model, 


®p(0 

0 

I 

^p{i) 

+ 

0 

u{t)  + 

0 

0 

w^{t) 

®«(*) 

-2ZQ. 

$ 

^11 

$ 

•m; 

0 

W2it) 

(6.2) 


y(t)=[  -n*  -2zn] 


®p(0 


w^it) 

W2{t) 


The  time  variables  and  their  dimensions  are  defined  as  follows: 


Xp  €  R"” . Modal  position  states 

x„  €  R"*" . Modal  velocity  states 

u  €  R™ . Control  input 

Wi  €  R** . Process  noise  (disturbance  input) 

Wj  €  R^ . Sensor  noise 

y  e  R* . Modal  acceleration  measurements 


For  the  simply  supported  plate  experiment,  m  =  p  =  1,  because  there  is  only  one 
control  shaker  and  one  disturbance  shaker.  The  number  of  modeled  modes,  n„,  varies 
according  to  the  order  of  the  plant  for  which  we  wish  to  design  a  controller,  although  it 
is  limited  to  nine  (the  total  number  of  modes  thus  far  identified).  Since  we  are 
modeling  n„,  modes,  the  number  of  modal  acceleration  measurements  is  /  =  n„.  This 
number  would  be  limited  to  twelve  (the  number  of  accelerometers),  should  there  be 
more  than  twelve  identified  modes  available.  The  excess  accelerometers  provide 
redundant  information  about  the  spatial  accelerations  fi'om  which  a  more  reUable  least 
squares  solution  to  the  modal  accelerations  is  computed. 

The  quantities  fl  and  Z  are  diagonal  matrices  of  the  modal  natural  frequencies 
and  damping  ratios,  respectively.  That  is. 
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n^cliag{tJi,u>2,...,u>  } 

fn 

=  2ffdiag{/„/2,...,/  } 

m 

Z  i  diag{C„Ca . C„  1 

m 


(6.3) 


where  the  mod2d  natural  frequencies  and  damping  ratios  are  given  in  Table  6.1. 


Table  6.1:  Natural  Frequencies  and  Damping  Ratios  for  the  First  Nine  Modes 


i 

fi  (Hz) 

— 

c. 

1 

49.447 

0.007722826 

2 

108.96 

0.01171460 

3 

130.25 

0.008318498 

4 

188.53 

0.002731109 

5 

203.25 

0.002725023 

6 

265.62 

0.002387554 

7 

285.78 

0.001224449 

8 

326.08 

0.001321583 

9 

338.30 

0.002220888 

The  column  vectors  and  are  the  modeshapes  corresponding  to  the  control 
and  disturbance  inputs,  respectively.  They  indicate  ths  relative  effect  each  shaker  has 
on  the  modal  amplitudes  and  are  a  function  of  the  shaker  locations.  These  modeshapes 
are  related  to  the  sensor  modeshape  matrix,  $,  introduced  in  Chapter  5  [see  (5.11)]. 
According  to  the  data  provided  by  the  identification  procedure,  the  matrix  $  for  a 
model  of  the  first  nine  modes  is  given  (to  four  places  past  the  decimal)  by: 
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$  = 


0.2068 

-0.5350 

0.4716 

0.4290 

-0.7181 

0.1104 

0.3072 

-0.4771 

-0.5241 

0.4001 

-0.2333 

0.8233 

0.7033 

-0.3062 

0.1353 

0.4829 

-0.1404 

-0.9120 

0.4468 

0.2215 

0.7428 

0.7033 

0.3664 

0.1091 

0.4480 

0.2853 

-0.8733 

0.2803 

0.4538 

0.5085 

0.4365 

0.7018 

0.0766 

0.1884 

0.4729 

-0.5331 

-0.6245 

-0.4200 

-0.3708 

-0.1516 

-0.6814 

0.4496 

0.6138 

-0.4607 

-0.3951 

-0.1619 

0.3468 

-0.6213 

-0.0554 

0.5246 

0.6389 

0.4227 

0.3599 

-0.6038 

0.6210 

0.3438 

-0.6273 

0.2185 

0.5573 

0.6715 

-0.4771 

0.3848 

-0.6201 

1.0755 

-0.5294 

-0.3369 

0.2210 

-0.7149 

0.4368 

-1.0680 

-0.4109 

-0.3970 

-0.5257  -0.6133  -0.2620 
-0.0798  0.6420  -0.4180 
0.4640  -0.6078  -0.2086 
0.5346  -0.2604  0.4473 
0.1319  0.3292  0.6070 
-0.6515  -0.2957  0.3680 
0.6039  0.3454  -0.3710 
0.1343  -0.4100  -0.5521 
-0.6411  0.3428  -0.3926 
-0.7840  0.6357  0.1768 
-0.1769  -0.8063  0.3551 
0.7980  0.5814  0.1820 


(6.4) 


The  twelve  rows  of  $  are  the  modeshapes  for  the  twelve  accelerometer  locations.  The 
control  and  disturbance  shakers  are  each  colocated  with  one  of  the  accelerometers,  so 
and  are  given  by  two  of  the  rows  of  Denote  the  row  of  #  by  Then 

=  K  =  (6-5) 

The  expression  for  g22  was  developed  in  section  5.1  [see  (5.14b)].  When  fewer  them  nine 
modes  are  modeled  (i.e.,  <  9),  the  appropriate  columns  of  $  are  eliminated.  For 

example,  in  order  to  model  only  the  first  four  modes,  we  take  $  to  be  equal  to  the  first 
four  columns  of  $  in  (6.4). 

The  control  input,  u,  passes  through  a  second-order  smoothing  filter  before 
reaching  the  shaker.  If  we  denote  the  control  command  entering  the  filter  by  u^,  a 
state-space  model  of  the  smoothing  filter  is: 

x,{t)  =  Aix,{t)  -i-  (6.6) 

U{t)  =  C[X,{t) 
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where 


W/  =  2jr  •  120,  =  .707 


The  disturbance  lOj  is  modeled  as  narrowband  noise  centered  about  60  Hz. 
Conceptually,  this  was  accomplished  by  passing  the  fictitious  white  noise  Vtiriable  v 
through  a  second-order  noise  shaping  filter.  The  state-space  model  of  that  shaping  filter 
is  as  follows; 

i  JO  =  (6.8) 

u{i)  = 

where 


=  27r  •  60,  Ctt.  =  -1 


Controllers  are  designed  firom  the  augmented  plant  model,  comprised  of  the 
interconnected  plate,  smoothing  filter,  and  noise  shaping  filter  dynamics  —  (6.1),  (6.6), 
and  (6.8).  The  n‘**-order  augmented  plant  is  described  by  the  following  state  and 
output  equations: 

x{t)  =  Ax{t)  -h  -f-  Giu;(t)  (6.9) 

y{t)  =  Cx{t)  -1-  G2w{t) 
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where 


(6.10) 


Ap  BpCf  0  0  §12 

A=  0  A,  0  ,5=B,  ,  G,  =  0  0 

0  0  0  0 

G  =  ^  Cp  DpCi  g2iCw  ]>  G2  =  ^  0  ir22  j 

The  noise  vector  w{t)  is  assumed  to  have  identity  covariance,  so  the  correlations  among 
the  individual  elements  are  specified  by  Gi  and  G2,  and  the  relative  intensities  of  the 
process  and  sensor  noise  are  adjusted  by  means  of  which  scales  §22  (see  (6.2)],  As 
stated  in  section  5.1,  ^12  ==  since  the  sensor  noise  {W2)  only  affects  the  output 
equation.  The  presence  of  the  smoothing  filter  eliminates  any  feedthrough  from  the 
controller  command  signal  to  the  outputs.  However,  for  many  of  the  experiments 
simulated  in  this  chapter,  the  smoothing  filter  was  ignored  in  order  to  thoroughly  study 
the  robustness  design  methods  on  a  simple  model  before  stepping  up  to  a  full  scale 
model.  Without  the  filter,  the  Xj  states  are  eliminated  and  =  u. 

Figure  6.1  shows  the  frequency  responses  of  the  plate  (for  n„,  =  4),  smoothing 
filter,  and  noise  shaping  filter  just  described.  The  frequency  response  of  the  plate  is 
represented  by  a  plot  of  its  maximum  singular  value.  That  enables  the  magnitude 
response  of  the  4x1  transfer  fxmction  matrix  to  be  expressed  by  a  single  curve.  The 
response  of  the  noise  shaping  filter  is  scaled  so  as  to  be  visible  in  the  magnitude  range  of 
the  plot.  This  curve  also  represents  the  frequency  content  of  the  eissumed  disturbance, 
although  the  actual  scaling  depends  on  the  intensity  of  that  disturbance  and  is 
considered  a  design  parameter.  The  120  Hz  cutofr  frequency  of  the  smoothing  filter  was 
chosen  to  allow  adequate  control  of  the  observed  modes  (modes  1-4)  while  minimizing 
excitation  of  the  higher  frequency  imobserved  modes. 
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Plant,  Noise-Shaping  Filter,  and  Smoothing  Filter 


Frequency  (Hz) 


Figure  6.1:  Frequency  Responses  of  Augmented  Plaint  Components  for  4- Mode  Model 
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6.2  Compaiison  of  Tradeoff  for  Different  Methods 


In  order  to  speed  up  computations  and  eliminate  unnecessary  complications,  the 
robust  control  methods  of  chapters  2  and  3  were  applied  to  a  single  mode  model  of  the 
plate  (i.e,  Um  =  1)>  smoothing  filter  was  omitted  from  the  augmented  plant 

model  of  (6.9)-(6.10).  This  continuous-time  augmented  plant  model  has  order  n  =  4. 


An  LQG  controller  was  chosen  for  the  baseline  design,  and  after  closing  the  loop 
it  was  found  that  system  stability  was  particularly  sensitive  to  two  of  the  plant 
parameters  —  the  natural  frequency,  Wj,  and  the  scalair  control  input  eigenvector, 
(corresponding  to  the  control  shaker  location).  The  cost  functional  weighting  matrices 
were  chosen  by  combining  the  modal  energy  penalty  technique  of  section  5.1  with  the 
disturbance  cancellation  algorithm  of  section  5.2  [using  (5.23)],  yielding: 


0 

0 

0 

0 

0 

1 

0 

0 

II 

■^12  — 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1.0002 

1.0001 

(6.11) 


Since  there  is  only  one  modal  output,  there  is  no  need  to  worry  about  relative 
intensities  of  multiple  sensor  noise  sources.  Therefore,  5122  simplified  to: 

922  —  Then,  as  usual,  the  covariance  matrices  were  given  by: 

V,  =  GiG/,  =  V^^  =  Gfio7  (6.12) 

The  sensitivity  of  system  stability  with  respect  to  Wj  and  varied  greatly  with  the 
selection  of  p„.  For  relatively  large  (i.e.,  relatively  small  disturbance  intensity), 
stability  is  more  sensitive  to  and  less  sensitive  to  whereas  for  relatively  small  />„, 
the  reverse  is  true. 

Both  Ccises  —  sensitivity  to  Wi  and  —  were  studied  in  order  to  test  the 
effectiveness  of  the  various  parameter-robustness  methods  under  different  conditions.  If 
one  method  were  clearly  and  consistently  superior  to  the  others,  it  would  stand  out 
during  these  tests.  Since  the  same  auxiliary  output  weighting  technique  is  common  to 
LQG/PRE,  the  frequency-domain  method  of  Chapter  2,  and  the  time-domain  method  of 
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Chapter  3,  the  comparative  effectiveness  of  these  methods  was  studied  by  improving 
robustness  of  system  stability  using  only  auxiliary  input  noise.  Separately,  only 
auxiliary  output  weighting  was  used.  Then  the  effect  of  combining  the  two  was 
demonstrated. 


A.  Uncertainty  in  Natural  Frequency 

When  the  cost  matrices  in  (6.11)  are  combined  with  the  noise  intensity 
parameter,  =  10  ~  ®,  a  +3%  error  in  the  natural  frequency  is  enough  to  drive  the 
closed-loop  system  (with  LQG  controller)  unstable.  In  order  to  evaluate  the  robust 
controller  designs,  a  specification  was  made  to  raise  the  stability  margin  to  ±  10%,  and 
the  resulting  loss  in  performance  was  compared  for  four  different  methods: 

Auxiliary  inputs  only: 

(1)  White  noise  (w.n.) 

(2)  Frequency-shaped  noise  (f.s.n.) 

(3)  Multiplicative  white  noise  (m.w.n.) 

Auxiliary  outputs  only: 

(4)  Auxiliary  output  penalty  (aux.  output) 


Note  that  the  term  “stability  margin”,  as  used  here  and  in  the  remainder  of  this  thesis, 
does  not  denote  a  gain  or  phase  margin  in  the  sense  of  clzissical  control  theory.  Rather, 
it  is  an  abbreviation  for  parameter  stability  margin  and  represents  the  maximum 
amount  a  parameter  may  deviate  from  its  nominal  (i.e.,  assumed)  value  without  causing 
system  instability. 

The  natural  frequency  appears  in  both  the  A-  and  C-matrices,  so  methods  (1), 
(2),  and  (4)  require  that  we  factor  both  AA  and  AC  to  obtain  [see  (1.8)]: 


0 


0 


M,  =  l,  iV,  =  iV,=[2.05  1  0  0  ] 


(6.13) 


The  natural  frequency  appears  in  matrix  elements  —  and  —  2Ciu;i  in  both  A  and  C, 
so  it  is  impossible  to  find  a  relative  scaling  for  the  corresponding  elements  of  AA  and 
AC  that  will  hold  for  any  size  deviation  of  Wj.  For  a  -1-5%  deviation  in  Wj 
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(or  —  2Ciu;i),  there  is  a  +  2.05  x  5%  deviation  in  (or  —  This  2.05  ratio  very 
nearly  holds  over  the  entire  parameter  range  of  interest,  hence  the  values  of  and  N^. 
in  (6.13). 

In  applying  method  (2),  the  noise  shaping  filter  Hj[s)  was  computed  directly 
from  equation  (2.11),  without  a  low-order  approximation.  The  full-order  shaping  filter 
did  not  raise  the  controller  order  excessively,  because  the  original  plant  order  was  so 
low.  Also,  due  to  the  structure  of  the  parameter  uncertainties,  the  same  noise  shaping 
filter  wcis  used  for  Hc{s),  so  no  additional  dynamics  were  necessary  for  that  filter. 
Therefore  the  controller  order  for  method  (2)  was  =  8,  as  opposed  to  =  4  for  the 
other  methods. 

The  design  parameters  needed  to  just  meet  the  ±  10%  stability  margin 
specification  are  as  follows: 


(1)  White  noise . 

(2)  Frequency-shaped  noise ... 

(3)  Multiplicative  white  noise 


/*«  =  Me  =  400 

(scale  Hg  and  by  a  factor  of  1100) 
7i  =  -64, 


Ai  — 


0  0  0  0 

2.05  1  0  0 

0  0  0  0 

0  0  0  0 


C,  =[2.05  1  0  0  ] 


(4)  Auxiliary  output  penalty . =  p^  =  600 


Figure  6.2  shows  the  results  of  the  performance/stability  robustness  tradeoff  for 
all  four  robust  controllers,  compared  with  the  baseline  LQG  controller.  Both  frequency- 
shaped  noise  and  multiplicative  white  noise  provided  more  suitable  auxiliary  input 
models  than  the  white  noise  of  LQG/PRE,  although  the  multiplicative  white  noise 
design  was  clearly  the  best  performer  of  the  three.  The  auxiliary  output  p>enalty  design 
gave  up  the  least  performance  of  all  in  achieving  the  stability  robustness  objective.  At 
the  nominal  value  of  lOi,  that  design,  and  the  multiplicative  white  noise  design,  are 
nearly  optimal  in  performance. 
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Parameter  Deviation  (percent/100) 


Figure  6.2:  Steady  State  Performance  and  Stability  Tradeoff 
for  Uncertainty  in  Natural  Frequency 


The  experiment  depicted  in  Figure  6.2  wiis  based  on  the  2issumption  of  steady 
state  process  noise  —  the  premise  of  LQG  optimal  control.  This  comparison  was 
repeated  with  an  impulse  applied  at  the  process  noise  port  [i.e.  the  signal  v  in  (6.10)]  in 
order  to  simulate  the  effect  of  transient  disturbamces  on  the  designs.  The  results  are 
shown  in  Figure  6.3.  The  LQG  curve  no  longer  represents  optimality  at  zero  parameter 
deviation,  rather  it  is  only  an  approximation  to  optimality.  In  fact,  designs  (3)  and  (4) 
both  outperformed  the  LQG  controller  in  the  presence  of  an  impulse  disturbance.  The 
multiplicative  white  noise  design  performs  relatively  somewhat  better  when  the 
disturbance  is  an  impulse,  as  was  expected.  The  signal  levels,  and  therefore  the 
auxiliary  input  eimplitudes,  vary  greatly  over  the  course  of  time,  and  the  multiplicative 
white  noise  model  takes  advantage  of  this.  On  the  other  hand,  the  finite-energy 
disturbemce  input  conflicts  with  the  premise  of  the  frequency^shaped  noise  design,  and  it 
performs  relatively  worse. 


B.  Uncertainty  in  Eigenvector 

In  order  to  make  the  closed-loop  system  sensitive  to  the  control  input 
eigenvector,  the  sensor  noise  intensity  parameter,  p„,  was  lowered  to 

p„  =  l/(4.9  X 10*).  That  resulted  in  the  same  baseline  stability  margin  as  in  the  natural 
frequency  uncertainty  problem  just  discussed.  A  -1-3%  deviation  in  (in  the 
matrix  only)  destabilized  the  closed-loop  system  with  LQG  controller.  The  matrix 
Z?p  =  was  left  unaltered  for  this  study,  because  hardware  experimentation  showed 
uncertainty  in  to  be  a  particular  problem. 

The  control  input  eigenvector  appears  in  the  second  row  of  the  augmented  B- 
column  vector,  so  the  obvious  choice  of  a  factorization  for  AB  was: 


0 


0 


iV6  =  l 


(6.14) 


The  same  stability  margin  specification  was  made  as  before  —  to  improve  that  margin 
to  ±  10%.  The  following  design  parameters  enabled  the  four  robustness  methods  to 
meet  that  specification: 
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Robustness  of  Impulse  Response  to  Uncty.  in  Natural  Frequency 


Figure  6.3:  Transient  Performance  and  Stability  Tradeoff 
for  Uncertainty  in  Natural  Frequency 
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(1)  White  noise . 

(2)  Frequency-shaped  noise ... 

(3)  Multiplicative  white  noise 


(scale  H^,  by  a  factor  of  .15) 
71  =  1.44x10-®, 


Si  = 


0 

1 

0 

0 


(4)  Auxiliary  output  penalty . pt  =  3.4 

The  steady  state  disturbance  restilts  for  this  problem  are  shown  in  Figure  6.4. 
The  auxiliary  output  penalty  method,  rather  than  being  the  most  effective,  was  by  far 
the  worst  performer  this  time.  Among  the  three  auxiliary  input  models,  multiplicative 
white  noise  once  again  gave  very  good  results,  but  provided  no  improvement  over  the 
simple  white  noise  model.  Surprisingly,  the  frequency-shaped  noise  model  led  to  a 
somewhat  greater  sacrifice  in  performance  than  the  white  noise  model. 

Plots  of  the  impulse  response  costs  for  this  example  are  shown  in  Figure  6.5. 
These  results  are  not  significantly  different  from  those  of  the  steady  state  disturbance 
case. 

C.  Results  for  Different  Factorizations 

Up  to  now,  only  the  individual  components  of  the  parameter  robustness  methods 
under  study  have  been  compared.  A  logical  procedure  for  choosing  among  LQG/PRE, 
the  frequency-domain  method  of  Chapter  2,  and  the  time-domain  method  of  Chapter  3 
is  to  determine  the  corresponding  auxiliary  input  model  that  gives  the  best 
performance,  then  to  combine  that  auxiliary  input  model  with  the  auxiliary  output 
penalty  conunon  to  all  three  methods.  Once  a  required  stability  margin  has  been 
specified,  the  auxiliary  input  noise  intensity  (through  ^6)  —  assuming  we  are 

using  LQG/PRE)  and  the  auxiliary  output  penalty  (through  p^,  and  may  be 

applied  individually  or  combined  in  any  number  of  different  proportions  to  just  meet 
the  stability  specification.  For  any  single  independent  parameter  uncertainty,  adjusting 
the  relative  magnitude  of  the  p-  and  p-scalars  is  equivalent  to  choosing  different 
factorizations  for  AA,  AB,  and  AC,  giving  the  corresponding  nonzero  elements  of  either 
the  M-  or  the  AT-matrices  a  relatively  greater  magnitude.  Of  course,  when  more  than 
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^  Tradeoff  for  Uncertainty  in  Eigenvector 

6.5:  Transient  Performance  and  Stabrbty  iraaeon 


one  independent  imcertainty  is  present,  the  factorizations  must  be  adjusted  in  order  to 
exercise  every  degree  of  freedom  available.  Analysis  of  a  few  different  factorizations 
should  reveal  one  that  is  very  nearly  the  best  possible  for  any  given  uncertainty. 

This  technique  was  applied  to  the  eigenvector  uncertainty  problem  in  section 
6.2B.  First,  the  white  noise  model  was  chosen  for  the  auxiliary  inputs,  because  of  the 
good  perform2ince  that  results  and  because  of  the  simplicity  of  design.  Then,  holding 
M(,  and  Nf,  constant,  the  ±  10%  stability  margin  was  attained  using  several  different 
combinations  of  values  for  and  p/,.  Those  values  £ire  detailed  in  Table  6.2  for  eight 
different  controller  designs.  The  performance/stability  results  of  each  design  are  plotted 


Table  6.2:  Controller  Design  Parameters  Corresponding  to  Eight  Different  Factorizations 


Case  Number 

1 

2 

3 

4 

5 

6 

B 

8 

f^b 

0 

10 

20 

30 

40 

50 

60 

70 

Pb 

3.4 

0.9 

0.5 

0.3 

0.15 

0.1 

0.05 

0 

in  Figure  6.6,  where  the  top  curve  represents  Case  1,  and  the  curve  for  each  successive 
decrease  in  nominal  cost  represents  the  next  higher  case  number.  These  results  are 
typiccil  of  a  those  from  a  number  of  such  experiments,  including  analysis  of  controllers 
baised  on  a  multiplicative  white  noise  model  of  the  auxiliary  inputs.  Normally,  either 
the  auxiliary  input  modeling  or  auxiliary  output  penalty  alone  provide  virtuedly  the  best 
overall  performance  possible,  although  neither  one  does  so  consistently.  So  far,  no 
signiffcant  improvement  has  been  achieved  by  combining  auxiliary  inputs  and  outputs. 

Blelloch  and  Mingori  (1990)  point  out  that  one  particular  factorization  has  a 
certcdn  intuitive  value  for  the  problem  of  multiple  natural  frequency  uncertainties 
(actuaJly  for  uncertainties  in  the  —  Wj*  elements  of  the  A-matrix).  Namely,  by 
factoring  A  A  such  that  the  nonzero  elements  of  and  are  equal,  the  auxiliary 
outputs  lead  to  a  penalty  on  the  elastic  strain  energy  of  the  flexible  structure.  However, 
this  approach  to  factoring  AA  h2is  no  bearing  on  the  relative  emphaisis  to  be  placed  on 
the  auxiliary  inputs  versus  the  auxiliary  outputs.  It  only  decides  what  relative 
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Cost:J''(l/2) 


Steady  State  Cost  for  Assorted  LQG/PRE  Factorizations 


Figure  6.6:  Comparative  Performance  for  Several  Different  LQG/PRE  Factorizations 
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importance  should  be  placed  on  the  various  parameter  rmcertainties.  Also,  no  results 
were  given  to  draw  comparisons  with  other  factorizations. 

6.3  ltd  vs.  Stratonovich  Noise 

In  the  examples  considered  in  the  previous  section,  the  multiplicative  white  noise 
models  were  interpreted  in  the  sense  of  Ito.  Recall  from  section  3.5  iiiat  in  order  to 
interpret  multiplicative  white  noise  as  Stratonovich  noise,  we  need  only  to  make  the 
following  substitutions  in  the  controller  design  equations: 

A  *-  A  +  ^  E  'TiAi^ ,  (3.73) 

^  i  =  l 

B  B  +  ^  t  liAiBi , 

c  ^  c  +  ^  f: 

«=i 

The  Stratonovich  noise  interpretation  was  applied  to  the  natural  frequency  uncertainty 
problem  of  section  6.2A  (using  atixiliary  input  modeling  only)  so  as  to  compare  the 
controller  performance  with  that  of  the  ltd  noise  design  already  discussed.  For  a  ±  10% 
stability  m2U'gin  requirement,  we  have  the  following  matrix  substitutions  for  A  and  C: 

(6.15) 

0  0 

0  0 

0  0 

0  0 


0 

70334 

1 

0  0  -142120  -75.398 


A  A 


A  *-  A  + 


0  0 

0.656  0.32 
0  0 

0  0 


where  on  the  right-hand  side  of  (6.15): 


A  = 


0  1  0 

-96525  -  4.7987  0 

0  0  0 
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and 


C  *•  C  -k-  5 

C  4-  C  +  [  .656  .32  0  0  ] 


(6.16) 


where  on  the  right-hand  side  of  (6.16): 

C  =  [  -96525  -4.7987  0  .70334  ] 

The  stability  specification  was  met  with  auxiliary  input  noise  of  such  a  low  intensity 
that  the  modifications  to  A  and  C  were  insignificant.  The  difference  in  the  stability 
and  perform<ince  characteristics  of  the  closed-loop  systems  for  the  two  noise 
interpretations  was  therefore  not  discernible. 


6.4  Reduced-Order  Controller 


A  much  higher  order  model  was  used  to  investigate  the  benefits  of  optimal 
reduced-order  control.  This  study  employed  a  4-mode  model  of  the  plate,  along  with 
the  second-order  smoothing  filter  and  second-order  disturbance  shaping  filter  to  create  a 
twelfth-order  augmented  plant.  The  design  objectives  for  the  reduced-order  controllers 
were  threefold:  (1)  to  provide  15  dB  of  rejection  at  the  assumed  disturbance  center 
frequency  (60  Hz),  (2)  to  minimize  the  increase  in  the  linear  quadratic  cost  over  that  of 
the  full-order  design,  and  (3)  to  provide  some  transient  suppression  along  with  the 
steady  state  rejection. 

The  disturbance  cancellation  method  of  section  5.2  was  found  to  provide  over 
15  dB  of  rejection  at  60  Hz  for  a  full-order  controller.  Combining  60  Hz  cancellation 
with  a  penalty  on  the  modal  energies  supplied  some  transient  suppression  as  well.  The 
cost  functional  matrices  which  resulted  were  similar  to  those  used  for  the  robustness 
studies: 


Hi  =  diag{n^/4x4, 0,0, 0,1.0002},  Hj  =  1, 


Rx2  — 


On  xi 

1.0001 


L 


(6.17) 


where  the  4x4  natural  frequency  matrix  fl  is  defined  in  (6.3).  As  before  the  noise 
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intensity  parameter  was  chosen  to  be  p„  =  10~®.  The  simple  model  for  g22  was  used; 
^22  =  Then  V2,  and  V’12  were  computed  from  (6.12). 

A  full-order  controller  was  designed  for  this  model,  then  its  performance, 
frequency  response,  and  time  response  characteristics  were  compared  with  optimal 
controllers  of  orders  4,  3,  and  2.  Table  6.3  quantifies  the  comparative  performance  of 
all  four  controllers. 

Table  6.3;  LQ  Cost  and  Disturbaince  Rejection  vs.  Controller  Order 


Order 

12 

4 

3 

2 

Rejection 

15.9  dB 

17.9  dB 

14.8  dB 

5.3  dB 

LQ  Cost 

1576 

1623 

1633 

2753 

Approximately  15  dB  of  rejection  at  60  Hz  is  retained  for  a  controller  order  as  low  ais  3, 
and  very  little  rejection  is  lost  by  substituting  this  3'^-order  controller  for  a  12‘**-order 
controller.  However,  the  2**'*-order  controller  is  mairkedly  worse  in  performance.  Note 
how  the  lineau:  quadratic  cost  rises  with  each  reduction  in  controller  order.  On  the 
transition  from  a  3‘^‘*-order  to  a  2"‘*-order  controller,  the  increaise  in  cost  is  very  steep. 

Figure  6.7  shows  the  magnitude  frequency  responses  of  the  first  modal 
acceleration  to  the  disturbaince  shaiker  input  for  the  open-loop  system  aind  three  closed- 
loop  designs.  The  3'^-order  controller  appears  to  be  the  design  of  choice.  We  see  here 
how  closely  the  closed-loop  system  behavior  with  this  controller  resembles  that  of  the 
system  with  full-order  controller.  When  the  controller  order  is  restricted  to  less  than 
three,  however,  the  optimadity  (in  the  full-order  sense)  of  the  design  begins  to  breadc 
down.  All  three  controllers  provide  some  tramsient  suppression  by  lowering  the  open- 
loop  peak  at  49  Hz  due  to  the  lightly  daimped  first  mode  at  that  frequency.  However, 
the  2"‘*-order  controller  fails  to  produce  a  notch  at  the  60  Hz  disturbance  center 
frequency.  By  subtracting  the  open-loop  response  from  each  of  the  closed-loop 
responses,  we  airrive  at  the  disturbance  rejection  plots  in  Figure  6.8.  The  vertical  line  is 
drawn  at  60  Hz  to  indicate  where  the  15  dB  of  rejection  is  desired.  The  2"^-order 
controller  is  not  able  to  simultaneously  dampen  the  first  mode  and  provide  good 
rejection  at  60  Hz.  The  frequency  response  plots  of  the  controllers  in  Figure  6.9 
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1  Acceleration  (dB) 


Frequency  Responses  for  Full-  and  Reduced-Order  Compensators 


Figure  6.7:  Frequency  Responses  for  Full-  and  Reduced-Order  Compensators 
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Disturbance  Rejection  for  Full-  and  Reduced-Order  Compensators 


Figure  6.8:  Disturbance  Rejection  for  Full-  and  Reduced-Order  Compensators 
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demonstrate  the  cause  of  this  limitation.  These  curves  represent  the  response  of  the 
tr<msfer  function  from  the  first  modal  acceleration  input  to  the  control  output.  The 
response  of  the  2"‘*-order  controller  can  approach  the  full-order  response  only  at  the 
asymptotes.  The  rapid  change  of  the  full-order  response  in  the  49-60  Hz  region  cannot 
be  matched  at  the  same  time  by  a  transfer  function  with  only  two  poles.  In  order  to 
compcire  the  time  responses  of  the  closed-loop  system  with  12***-  and  3”*-order 
controllers,  a  60  Hz  disturbance  was  applied  and  the  first  modal  acceleration  was 
measured.  These  results  are  shown  in  Figure  6.10.  Again,  we  see  how  little 
performance  is  sacrificed  to  achieve  this  dramatic  reduction  in  controller  order. 
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Frequency  Response  of  Full-  and  Reduced-Order  Compensators 
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Figure  6.9:  Frequency  Response  of  Full-  and  Reduced-Order  Compensators 


Mode  1  Acceleration 
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Time  Response  to  60  Hz  Disturbance 
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Figure  6.10:  Time  Response  for  Full-  and  Reduced-Order  Compensators 
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7.  Application  to  Simply  Supported  Plate 


7.1  Stability  Robustness  Problem  and  its  Solution 

For  experimentation  with  the  simply  supported  plate  hardware,  a  sinusoidal 
60  Hz  disturbance  was  applied  at  the  disturbance  shaker.  When  standard  LQG 
controllers  were  implemented,  very  little  disturbance  rejection  could  be  accomplished. 
Large  controller  authorities  (in  the  form  of  a  small  penalty,  R^.,  on  the  control  input) 
resulted  in  system  instability  due  modeling  errors  of  the  plate.  In  order  to  determine 
possible  sources  of  the  modeling  errors,  the  control  penalty  was  gradually  lowered  auid 
the  LQG  controller  was  repeatedly  redesigned  until  instability  resulted.  Then  the 
sensitivity  of  the  model  to  deviations  in  its  parameters  was  analyzed.  The  augmented 
plant  model  from  which  the  controllers  were  designed  was  the  same  4-mode,  12“*-order 
model  described  in  section  6.4.  The  covariance  matrices  were  also  the  same,  with  the 
minor  exception  that  was  computed  as  in  (6.2),  }rielding  a  slightly  different  value  for 
V].  In  order  to  keep  matters  simple,  direct  60  Hz  disturbance  cancellation  was  not 
attempted.  Rather,  a  penalty  was  applied  to  the  control  input  and  the  modal  energy  of 
the  first  mode  only.  The  plant,  cost,  and  covariance  matrices  were  then  translated  to 
zero-order  hold  equivalent  form,  and  the  LQG  controllers  were  designed  directly  in 
discrete-time.  The  following  (continuous-time)  cost  functional  matrices  resulted  in 
instability  of  the  simply  sup{}orted  plate: 

=diag{u;,,0, 0,0, 1,0,0,0,0, 0,0,0},  ^2  =  2x10®,  Ri2  =  0  (7.1) 

Continuous-time  analysis  of  the  closed-loop  system  model  with  LQG  controller 
designed  from  these  cost  matrices  revealed  that  the  system  was  robust  to  all  of  the 
plate’s  damping  ratios  and  modal  frequencies  except  one.  A  —  5%  or  a  6%  deviation 
in  the  second  natural  frequency  wsis  found  to  destabilize  the  system  model.  The 
nominal  natural  frequency  for  the  second  mode  (from  Table  6.1)  is  approximately  109 
Hz.  Therefore,  if  the  plant  model  were  otherwise  accurate  and  the  actual  second 
naturid  frequency  of  the  plate  were  anywhere  outside  the  rmge  103.5-115.5  Hz, 
instability  would  result.  In  order  to  correct  this  problem,  a  requirement  was  placed  on 
the  stability  margin  with  respect  to  0/2  —  to  improve  the  margin  to  di:  10%.  Three 
different  modified  LQG  designs  were  applied  to  just  meet  this  specification:  (1)  white 
noise  auxiliary  input  modeling,  (2)  multiplicative  white  noise  input  modeling,  and 
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(3)  auxiliary  output  penalty.  Since  the  simultaneous  application  of  auxiliary  inputs  and 
outputs  was  found  in  Chapter  6  to  have  little  success,  no  combination  of  the  two  was 
attempted.  Also,  frequency-shaped  auxiliary  input  noise  was  not  considered,  because  of 
its  relative  ineffectiveness  during  experimentation  with  the  model  and  because  it  would 
result  in  a  controller  of  order  greater  than  twelve,  meaning  that  the  controller  would  be 
too  slow  for  implementation  without  using  some  kind  of  order  reduction. 

The  factorization  used  to  compensate  for  the  natural  frequency  uncertainty  is 
analogous  to  the  one  used  in  section  6.2A.  for  the  1-mode  model: 


M,= 

Osxi 

1 

,  M,= 

0 

1 

0 

Ofixl 

0 

Ar.  =  jv.  =  |  0 


2.05  0]),3  1  Ojxs] 


(7.2) 


The  design  parameters  that  the  three  robust  designs  used  to  meet  the  ±  10%  stability 
margin  requirement  are: 


(1)  White  noise . /i„  =  /ig  =  .22 

(2)  Multiplicative  white  noise . 7j  =  2.89, 


Ai  — 


2.05 


0  2.05 


O5X12 

0  0  0 

^6x12 
X  12 

0  0  0 

^2x12 


1  O1X6  h 


1  X6 


(3)  Auxiliary  output  penalty . =  p^  =  7x10* 


The  performance/stability  tradeoff  results  for  all  three  cmdidate  designs  is  depicted  in 
Figure  7.1,  along  with  the  LQG  results.  Even  though  the  white  noise  auxiliary  ifiput 
model  gave  very  poor  nominal  performance  in  the  1-mode  natural  frequency  uncertainty 
example  of  section  6.2A.,  that  same  method  of  robust  design  sacrificed  virtually  no 
nominal  performance  here  and  was  clearly  the  best  performer  overall.  The 
multiplicative  white  noise  design  did  provide  a  larger  stability  mMgin  for  positive 
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Figure  7.1:  Performance/Stability  Robustness  Tradeoff  for  Uncertainty 
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d<  ations  in  a;2,  but  the  natural  frequency  is  surely  known  to  within  28%,  so  the 
additional  stability  margin  to  +  34%  is  of  no  value. 

The  white  noise  auxiliary  input  model  requires  only  that  we  make  the  following 
two  substitutions  for  the  original  covariance  matrices: 

(7-3) 

Then  we  use  LQG  to  arrive  at  a  different  Kalman  filter  (but  the  same  regulator  gains). 
This  parameter  robust  controller  design  was  implemented  in  the  hardware  and  did  in 
fact  stabilize  the  system.  The  measured  time  responses  of  mode  1  and  mode  2 
accelerations  to  the  60  Hz  disturbance  are  shown  in  Figures  7.2  and  7.3,  respectively. 
After  the  loop  was  closed,  the  LQG  controller  rejected  the  response  in  mode  1,  but  only 
temporarily,  as  mode  2  was  driven  unstable.  The  robust  controller  stabilized  mode  2 
(and  the  entire  system)  while  giving  up  only  a  small  amotmt  of  rejection  in  mode  1. 

7.2  Effectiveness  of  Reduced-Order  Controllers 

Robust  optimeil  controllers  of  orders  3  and  4  were  designed  using  the  same  white 
noise  auxiliary  input  model  and  the  same  cost  and  covariance  matrices  used  to  design 
the  full-order  robust  controller  discussed  above.  For  this  level  of  controller  authority, 
the  iterative  relaocation  method  of  section  5.4  proved  to  be  sufficient  for  solving  the 
optimal  projection  equations.  Since  a  sampled-data  version  of  the  optimal  projection 
equations  was  not  available,  the  reduced-order  controllers  were  designed  in  continuous¬ 
time,  then  discretized  by  means  of  a  bilinear  transformation.  The  3”*-  and  4“‘-order 
controllers  significantly  reduced  the  computational  delay  caused  by  the  full-order 
controller,  so  the  one-sample-interval  delay  was  ignored  on  the  first  attempt.  Then,  in 
each  case,  a  second  controller  was  designed  based  on  a  2°**-order  Fade  approximation  of 
the  delay.  This  time  delay  model  raised  the  order  of  the  augmented  plant  from  12  to 
14. 

The  sample  rates  for  all  three  controller  orders  are  given  in  Table  7.1. 


Mode  1  Acceleration 


Mode  1  Response  to  60  Hz  Disturbance 


Figure  7.2:  Robust  Control  —  Mode  1  Response  to  60  Hz  Disturbance 
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Mode  2  Acceleration 


8 


Mode  2  Response  to  60  Hz  Disturbance 


Time  (sec) 


Figure  7.3:  Robust  Control  —  Mode  2  Response  to  60  Hz  Disturbance 
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Table  7.1:  Sample  Rates  for  Full-  and  Reduced-Order  Controllers 


Controller  Order 

12 

4 

3 

Sample  Rate  (Hz) 

1997 

3615 

4023 

The  improvement  of  the  reduced-order  sample  rates  over  that  of  the  full-order  design 
was  not  as  dramatic  as  one  might  expect,  because  the  full-order  controller  W2is  able  to 
tcike  advantage  of  the  structure  of  the  modal  state-space  realization  of  the  plant.  It 
exploited  the  spcirseness  of  the  block-diagonal  system  matrix  and  the  spairseness  of  the 
output  matrix  to  speed  up  the  Kalman  filter  considerably. 

In  order  to  examine  the  effect  of  computational  time  delay  on  control  system 
performance,  an  unmodeled  delay  of  one  sample  interval  was  introduced  into  the  full- 
order  robust  controller  by  preventing  the  Kadman  filter  from  predicting  one  step  ahead. 
The  delay  did  not  cause  instability,  but  controller  performance  suffered  somewhat. 
Figure  7.4  shows  the  comparative  responses  of  the  first  mode  to  a  60  Hz  disturbance  for 
full-order  controllers  with  and  without  the  time  delay. 

Even  though  continuous-time  analysis  predicted  the  3"*-order  controller  would 
stabilize  the  system,  it  did  not,  regardless  of  whether  the  computational  delay  was 
modeled.  The  4‘**-order  controller  stabilized  the  system  only  when  the  delay  was 
modeled,  and  in  that  Ccise  gave  quite  good  performance.  Again,  a  60  Hz  disturbance 
was  applied  to  the  plate,  and  the  responses  of  modes  1  emd  2  were  me2isured.  In  Figures 
7.5  and  7.6,  these  results  are  compared  with  those  of  the  f\ill-order  controller.  The 
4**'-order  controller  does  give  up  a  small  amount  of  disturbance  rejection,  but  increases 
the  sample  rate  significantly.  The  significance  of  this  experiment  is  its  demonstration 
that  parameter  robust  and  reduced-order  control  can  be  accomplished  simultaneously  to 
stabilize  a  flexible  structure  and  reject  disturbances  with  minimal  computational  delay. 
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Mode  1  Acceleration 


Mode  1  Response  to  60  Hz  Disturbance 


Time  (sec) 


Figure  7.4:  Response  of  Full- Order  Robust  Controllers  With  and  Without  Time  Delay 
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Mode  1  Acceleration 


Mode  1  Response  to  60  Hz  Disturbance 


Time  (sec) 


Figure  7.5:  Robust  Reduced-Order  Control  —  Mode  1  Response  to  60  Hz  Disturbance 
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Mode  2  Acceleration 


8 


Mode  2  Response  to  60  Hz  Disturbance 
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8.  Conclusions 


As  stated  in  the  introduction,  the  objectives  of  this  work  were  to  comp2ue  a 
number  of  different  parameter  robustness  techniques  based  on  the  LQG/PRE  error 
model,  to  investigate  the  limitations  on  controUer  order  reduction,  and  to  evaluate  the 
combined  robust  minimal  order  design. 

8.1  Robustness 

The  few  parameter  robust  control  problems  considered  in  Chapters  6  and  7  serve 
as  a  counterexample  to  the  proposition  that  any  one  of  the  robust  design  techniques 
studied  is  consistently  superior  to  the  others.  In  fact,  the  white  noise,  multiplicative 
white  noise,  and  auxiliary  output  designs  each  provided  the  best  performance  for  at 
least  one  problem.  The  best  performer  in  one  case  was  found  to  be  the  worst  performer, 
by  fax,  in  another.  Also,  the  frequency-shaped  noise  design  proved  to  be  a  contender  at 
times.  Therefore,  no  strong  conclusions  can  be  drawn  on  the  basis  of  performance  alone. 

Although  the  combination  of  an  auxiliary  input  and  auxiliary  output  model 
provides  greater  flexibility  in  the  design,  experience  seems  to  indicate  that  only  rarely  is 
any  advantage  gained  from  such  a  combination.  Even  then,  the  advantage  appears  to 
be  very  slight.  This  result  is  useful,  since  the  elimination  of  this  flexibility  from  the 
design  allows  us  to  eliminate  the  complexity  of  choosing  among  an  infinite  number  of 
A/LiV-factorizations  for  each  parameter  uncertainty.  Excellent  performance  and 
stability  characteristics  are  attained  consistently  by  choosing  the  better  of  the  two 
extremes  —  auxiliary  inputs  only  or  auxiliary  outputs  only.  Assuming  this 
simplification,  we  may  discuss  the  comparative  merits  of  the  time-domain  method,  the 
frequency-domain  method,  and  LQG/PRE  in  complete  generality  by  considering  the 
three  auxiliary  input  models  and  the  auxiliary  output  model  each  on  its  own. 

The  multiplicative  white  noise  model  for  the  auxiliary  inputs  was  the  most 
consistent  of  the  four  models  in  providing  good  performance  for  a  specified  stability 
margin.  Relative  to  the  other  models,  this  one  tends  to  show  added  improvement  when 
the  signal  amplitudes  of  the  plant  vary  greatly  over  time,  such  as  is  the  case  when  the 
plant  is  subjected  to  transient  disturbances.  However,  the  performance  provided  by  this 
model  was  repeatedly  matched  or  beaten  by  simpler  models. 
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The  frequency-shaped  noise  model  has  not  proven  to  be  superior  for  <iny  example 
studied  thus  far.  Although  it  has  in  some  cases  resulted  in  a  controller  with  better 
performance  than  one  designed  using  a  white  noise  model,  the  added  design  complexity 
would  seldom  justify  the  small  advantage  this  model  might  provide.  The  intent  of 
frequency  shaping  is  to  model  the  low  and  high  frequency  rolloff  of  the  auxiliary  input 
frequency  response.  However,  the  frequency  content  of  white  noise  outside  the 
passband  of  the  system  (which  poorly  models  reality)  is  greatly  attenuated  and 
apparently  has  no  significant  harmful  effect.  In  the  presence  of  transient  disturbances, 
the  frequency-shaped  noise  model  tends  to  impair  performance,  since  its  design  relies  on 
the  frequency  spectrum  of  an  assumed  steady  state  disturbance. 

The  white  noise  auxiliary  input  model  and  the  auxiliary  output  penalty  each 
yield  poor  performance  in  some  cases.  However,  one  or  the  other  consistently  provides 
very  good  performance,  and  the  simplicity  of  their  associated  controller  design  methods 
has  important  advantages.  Both  models  require  only  the  modification  of  cost  or 
covariance  matrices  in  a  standard  LQG  design.  Therefore,  they  do  not  require  the 
iterative  design  computations  of  the  multiplicative  white  noise  model,  nor  do  they 
increase  controller  order,  as  does  the  frequency-shaped  noise  model.  The  rapidity  of 
controller  design  allows  more  effort  to  be  devoted  to  analysis,  which  is  crucial  when 
there  are  multiple  independent  parameter  uncertainties. 


8.2  ControUer  Order 

For  a  12‘**-order  augmented  plant  (14**’-order  with  the  computational  delay 
model)  we  were  able  to  design  a  3'^-order  controller  in  theory  and  a  4***-order  controller 
in  practice  without  sacrificing  a  great  deal  of  performance.  Instead  of  determining  the 
minimal  controller  order  that  is  practical  by  successively  designing  optimal  controllers 
of  many  different  orders,  we  may  examine  the  frequency  response  of  the  full-order 
controller  to  find  out  what  order  a  controller  must  have  to  display  the  significant 
features  of  that  response.  That  order  serves  as  a  good  first  iteration  to  the  minimum 
controller  order  desirable.  The  addition  of  more  modes  to  the  model  would  not  tend  to 
raise  the  order  of  a  controller  necessary  to  provide  a  good  response,  because  any 
additional  modeled  modes  would  necessarily  be  less  dominant.  This  effect  was 
demonstrated  by  the  addition  of  the  2"‘^-order  computational  delay  model  to  the  plant 


112 


in  Chapter  7.  Not  only  did  the  minimum  controller  order  fail  to  rise,  it  fell  due  to  the 
more  precise  modeling  of  the  plant. 


8.3  Parameter  Robust  Reduced-Order  Design 

The  application  of  the  optimal  projection  equations  to  the  auxiliary  input  and 
output  models  of  LQG/PRE  is  new  and  proved  to  be  very  successful.  Its  application  to 
the  LQG/PRE  error  model  in  general  makes  these  methods  of  parameter  robust 
controller  design  much  more  powerful.  The  reduction  in  computational  delay  makes 
active  vibration  control  possible  when  it  otherwise  may  not  have  been.  Also,  optimal 
reduced-order  design  allows  us  to  model  more  d}mamics  to  give  the  controller  a  more 
accurate  picture  of  the  plant,  thus  reducing  another  source  of  uncertainty  —  unmodeled 
dynamics.  The  additional  d3mamics  may  include  more  disturbance  modeling, 
smoothing  filter  modeling,  aind  time  delay  modeling.  In  the  robust  reduced-order  design 
of  Chapter  7,  the  sacrifice  in  performance  may  have  been  reduced  or  eliminated  by 
giving  up  some  of  the  improvement  in  sample  rate  in  favor  of  modeling  more  high 
frequency  modes.  This  experiment  was  not  carried  out,  because  it  would  have  required 
rewiring  of  the  smoothing  filter,  which  was  designed  for  a  4-mode  model.  However,  the 
4‘^-order  robust  controller  as  designed  did  stabilize  the  system  and  provide  a  significant 
amount  of  disturbance  rejection. 


8.4  Directions  for  Further  Study 

The  most  conspicuous  shortcoming  of  the  design  methods  and  algorithms 
presented  in  this  study  is  the  lack  of  a  method  of  designing  optimal  reduced-order  and 
multiplicative  white  noise  based  controllers  directly  in  discrete-time.  This  deficiency 
results  in  the  introduction  of  two  additional  sources  of  error.  Firstly,  the  design  of  an 
optimal  continuous-time  controller  followed  by  the  transformation  of  that  controller  into 
a  discrete-time  equivalent  is  a  suboptimal  process.  Secondly,  for  the  continuous-time 
design  an  approximation  is  necessary  to  create  a  finite-dimensional  model  of  the 
inevitable  computationad  delay.  In  a  continuous-time  augmented  plant  model,  the 
computational  delay  is  not  only  imperfectly  modeled,  it  may  also  require  the  addition  of 
high  order  dynamics  in  order  to  be  modeled  adequately,  particularly  when  the  plant  is 
multi-input  multi-output.  The  solution  of  the  optimal  reduced-order  control  problem 
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for  sampled-data  systems  is  of  particiilar  interest,  since  it  would  have  widespread 
application  in  LQG-based  design. 
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1.0  INTRODUCTION 


This  investigation  is  concerned  with  effective  state  estimation  of  a  system 
driven  by  an  unknown  nonGaussian  input  with  additive  white  Gaussian  noise,  and 
observed  by  measurements  containing  feedthrough  of  the  same  nonGaussian  input 
and  corrupted  by  additional  white  Gaussian  noise.  The  motivation  behind  this 
work  is  the  problem  of  robust  modal  control  of  large,  multiple  input  multiple 
output  (MIMO),  heavily  damped  structures.  The  scope  of  the  problem  involves 
system  identification  of  the  system  parameters,  optimal  estimation  of  the  system 
states,  and  robust  control  of  the  heavily  damped  modes  of  the  system.  This 
investigation  will  deal  specifically  with  the  state  estimation  portion  of  the  problem. 

In  an  earlier  study  [1],  Kalman  filter  estimation  techniques  [2],  [3]  are 
developed  for  a  lightly  damped,  simply  supported  plate.  The  input  forcing  function 
is  assumed  to  be  narrowband  Gaussian,  which  is  adequately  modeled  by  passing 
white  Gaussian  noise  through  a  linear,  time-invariant  filter.  Thus,  knowledge  of 
the  power  spectral  density  of  the  narrowband  process  is  sufficient  for  the  design  of 
an  adequate  estimator.  Since  the  plant  and  measurement  equations  for  this  system 
are  linear,  and  the  inputs  and  plant  and  measurement  disturbances  are  Gaussian, 
the  Kalman  filter  will  provide  the  optimum  minimiim  mean-square  error  estimate 
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of  the  system  states  [4]. 

Although  the  Gaussian  assumption  for  modeling  many  types  of  inputs  and 
noise  processes  is  valid  in  a  wide  range  of  applications,  in  practice  it  may  not  be  a 
good  assumption  for  some  signal  models.  For  instance,  also  considered  in  [1]  aire 
large,  heavily  damped  structures  modeled  by  a  plant  with  complex  modes.  The 
input  to  the  plant  is  imknown,  but  its  frequency  characteristics  are  known  and  the 
input  is  always  present.  The  signal  model  chosen  to  represent  this  input  is  a 
stochastic  FM  signal,  generated  by  frequency  modulating  a  sinusoid  with  a 
Gaussian  process.  This  stochastic  FM  signal  has  a  highly  nonGaussian  probability 
density  function.  The  power  spectral  density  of  a  signal  of  this  type  appears 
similar  to  that  of  a  Gaussian  narrowband  signal.  An  estimator  design  based  solely 
on  the  power  spectral  density  of  a  nonGaussian  signal  assumed  to  be  a  Gaussian 
narrowband  signal  may  provide  very  poor  estimates. 

Estimation  techniques  need  to  be  developed  that  can  cope  effectively  with  the 
nonGaussian  nature  of  certain  signals.  One  such  approach  is  the  Gaussian  sum 
technique,  developed  by  Alspach  [6-8].  The  density  function  of  each  nonGaussian 
process  of  the  system  is  approximated  by  a  weighted  sum  of  Gaussian  density 
functions.  The  conditional  density  of  the  state  given  the  available  measurement 
sequence,  necessary  in  the  Kalman  filter  development,  is  updated  using  the 
Gaussian  sum  approximations  and  Bayes’  nile. 

A  serious  limitation  in  Alspach’s  approach  is  that  the  number  of  Gaussian 
terms  used  to  approximate  the  density  functions  increases  at  each  time  iteration. 
A  modified  approach  is  required  to  alleviate  this  limitation.  A  modified  estimation 
structure  is  developed  based  on  an  adaptive  Kalman  filter  scheme  first  presented 
by  Magill  [16],  and  extended  by  Moose  [23].  Essentially,  a  parameter  vector  is  used 
to  uniquely  describe  each  Gaussian  term  in  the  estimator.  The  parameter  vector  is 
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restricted  to  be  randomly  chosen  from  the  same  finite  set  of  known  values  at  each 
iteration.  By  using  a  nonGaussian  signal  model  in  conjunction  with  the  modified 
formulation  of  the  Gaussian  sum  estimator,  the  number  of  Gaussian  terms  at  each 
iteration  of  the  estimator  will  be  fixed,  thereby  avoiding  the  growing  memory 
problem. 

The  modified  estimation  algorithm  is  termed  the  modified  Gaussian  sum 
adaptive  filter  and  forms  the  basis  for  the  nonGaussian  state  estimation  in  this 
investigation.  An  algorithm  similar  to  this  one  that  does  not  make  reference  to  the 
Gaussian  sum  characteristic  is  known  in  the  literature  as  the  parallel  processing 
algorithm  [4],  or  the  multiple  model  algorithm  [5].  Both  the  Gaussian  sum  and 
modified  Gaussian  sum  algorithms  have  been  implemented  in  a  variety  of 
important  engineering  applications  such  as  phase  and  frequency  estimation  [36], 
geophysical  field  navigation  [37],  maneuvering  target  tracking  [24],  and  specification 
of  route  widths  for  air  traffic  controllers  [20]. 

Several  contributions  to  the  field  of  applied  estimation  theory  are  made  firom 
this  investigation.  These  include: 

1.  The  development  of  a  modified  Gaussian  sum  eilgorithm  with  nongrowing 
memory  based  on  a  nonGaussian  signal  model  with  a  Gaussian  sum 
probability  density  function.  Parameters  from  this  model  are  used  directly  in 
the  modified  Gaussian  sum  adaptive  filter  structure. 

2.  A  comparison  between  the  Gaussian  smn  (GS)  filter  of  [6]  and  the  modified 
Gaussian  sum  (MGS)  adaptive  filter.  The  two  cure  similcir,  but  the  comparison 
shows  the  MGS  adaptive  filter  to  be  a  good  improvement  to  the  GS  filter. 
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3.  An  examination  of  a  necessary  condition  for  effective  MGS  estimation.  This 
condition  provides  a  simple  test  to  determine  if  the  MGS  adaptive  filter  will 
work  properly  for  a  given  system. 

4.  An  alternate  configuration  of  the  MGS  adaptive  filter  when  the  necessary 
condition  of  3  above  is  not  met.  This  configuration  is  applied  in  several  ways, 
and  each  is  evaluated  on  a  performance  basis. 

5.  Two  methods  of  monitoring  and  updating  key  parameters  of  the  MGS 
adaptive  filter.  These  allow  the  estimator  to  react  to  changes  in  the  input 
signal  level  which  cause  the  signal  to  be  nonstationary  over  long  periods  of 
time. 

The  dissertation  begins  with  a  literature  review  in  Chapter  2.  This  chapter 
covers  previous  work  and  apphcations  in  the  areas  of  Gaussian  sum  estimation, 
adaptive  Kalman  filtering,  and  nonGaussian  estimation.  Chapter  3  begins  with  the 
development  of  the  Gaussian  s\mi  estimator.  Then,  a  detailed  description  of  the 
nonGaussian  signal  model  and  the  alternate  formulation  of  the  Gaussian  sum 
density  approximation  is  given.  A  curve  fitting  procedure  used  to  find  the  initial 
modified  Gaussian  sum  parameter  vector  is  outlined.  The  MGS  adaptive  filter  is 
developed  for  a  gener2d  system  and  a  simulation  example  is  given.  Several 
differences  between  the  MGS  and  GS  algorithms  are  examined.  Chapter  4 
describes  the  modal  system,  the  nonGaussian  input  model,  and  the  MGS  adaptive 
filter  based  on  this  system.  A  simulation  example  using  the  MGS  adaptive  filter  is 
given  and  the  results  are  compared  to  those  produced  from  an  augmented  Kalman 
filter  based  on  an  augmented  system  model  assuming  a  narrowbcind  Gaussian  input 
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signal.  A  necessary  condition  for  effective  MGS  estimation  is  derived.  Alternate 
estimation  procedures  axe  developed  to  compensate  for  situations  when  this 
condition  is  not  met.  Several  configurations  are  simulated  and  their  performance 
results  are  analyzed  and  compared.  Chapter  5  discusses  two  methods  of  monitoring 
and  updating  key  parameters  of  the  MGS  adaptive  filter.  Simulation  results  are 
analyzed  to  investigate  the  performance  of  these  methods.  Finally,  Chapter  6  gives 
the  conclusions  and  outlines  suggested  directions  for  future  investigations. 
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2.0  PREVIOUS  RESEARCH  AND  RELATED  WORK 


This  chapter  presents  a  review  of  the  literature  relevant  to  this  investigation. 
It  begins  with  a  discussion  of  the  Gaussian  sum  estimation  technique.  Many 
examples  of  its  use  in  a  wide  range  of  applications  are  given.  Next,  an  eulaptive 
Kalman  filtering  method  is  outlined  along  with  several  modifications.  Several 
examples  of  the  various  applications  of  this  method  axe  presented.  Finally,  an 
overview  of  alternate  nonGaussian  filtering  and  its  applications  is  discussed. 


2.1  Gaussian  Sum  Estimation 

An  estimation  technique,  applicable  to  both  linear  systems  with  nonGaussian 
inputs  and  nonlinear  systems  with  Gaussian  inputs,  has  been  developed  by  Alspach 
[6-8].  In  the  case  of  linear  systems,  the  noise  processes  associated  with  the  plant 
and  measurement  are  assumed  known  and  nonGaussian.  The  probability  density 
function  of  each  noise  process  is  approximated  by  a  Gaussian  sum;  that  is,  a 
weighted  sum  of  Gaussian  density  functions.  The  Gaussian  sum  approximation  is 
written  as 


PREVIOUS  RESEARCH  AND  RELATED  WORK 


6 


(2.1.1) 


M 


Pgs(®)  = 

t=l 


where 


M 

^aj  =  1;  Qfi  >  0  for  i  =  1,2, 

i=l 


M 


(2.1.2) 


and 


(2.1.3) 


For  sufficiently  large  Af,  any  density  function  can  be  closely  approximated  by  a 
Gaussian  sum.  As  long  as  condition  (2.1.2)  holds,  the  Gaussian  sum  is  always  a 
valid  density  function. 

The  basic  filtering  problem  is  to  estimate  the  state  of  the  system  from  the 
current  measurement  sequence  The  “best”  estimate  X/^  is 

found  by  minimizing  or  maximizing  a  particular  performance  criterion,  such  as 
minimizing  the  mean-square  error  between  x^  and  x^.  In  the  Bayesian  approach  to 
estimation,  this  requires  the  use  of  the  a  posteriori,  or  conditional,  density  function 
P(®fc  \Zk)- 

\ 

The  Gaussian  sum  estimator  [6-8]  is  briefly  described  here  amd  is  more  fully 
developed  in  chapter  3.  The  density  functions  of  the  initial  state,  the  plant  noise 
process,  and  measurement  noise  process  are  approximated  by  Gaussian  sum 
densities  of  the  form  of  (2.1.1).  These  densities  are  then  directly  used  to  form  the 
conditional  density  fimction  p(xjt  |  Zf.)  as  a  sum  of  Gaussian  densities.  An 
imfortunate  consequence  of  using  these  Gaussian  sum  densities  is  that  the  number 
of  Gaussian  terms  that  forms  p(z)^  |  Zf.)  increases  at  each  stage  of  the  estimator. 
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This  growing  memory  problem  is  a  serious  limitation  in  Alspach’s  development.  In 
chapter  3,  a  nonGaussian  signal  model  is  developed  and  used  in  conjimction  with  a 
modified  Gaussian  siun  adaptive  filter  that  avoids  the  growing  memory  problem. 

Alspach  has  applied  the  Gaussian  sum  technique  to  a  wide  variety  of 
applications.  In  conjunction  with  Sorenson,  he  applies  the  technique  to  hnear 
systems  with  nonGaussian  noise  [7],  and  to  nonlinear  systems  with  Gaussian  noise 
[8].  In  [9],  [10],  and  with  Scharf  and  Abiri  in  [11],  he  addresses  the  problem  of 
linear  systems  with  Gaussian  noise  processes  having  unknown  covariances.  Joint 
identification,  tracking,  and  prediction  in  a  multi-target,  multi-sensor  environment 
is  considered  in  [13],  and  with  LaGrotta  in  [12].  Alspach  and  Sorenson  [14]  use  the 
Gaussian  sum  technique  in  conjunction  with  proving  the  validity  of  the  separation 
theorem  for  linear,  nonGaussian,  optimal  control  problems.  Alspach  extends  this 
work  in  [15]  to  nonlinear  systems.  Here  he  does  not  use  the  sepeiration  theorem, 
but  certainty  equivalence  control  instead,  producing  a  suboptimal  control  algorithm 
useful  for  off-line  investigations. 

Other  researchers  have  used  the  Gaussian  sum  approach  in  their  work.  Tam 
and  Moore  [36]  developed  a  Gaussian  sum  estimator  using  extended  Kalman  filters 
in  the  problem  of  angle  demodulation.  Dmitriev  and  Shimelevich  [37]  applied  the 
technique  in  determining  the  coordinates  of  a  moving  vehicle  firom  geophysical  field 
measurements.  Sirisena  and  Brown  [38]  apphed  the  algorithm  to  probabilistic  and 
stochastic  load  flow  problems.  Namera  and  Stubberud  [39]  used  the  Gaussian  sum 
approach  in  solving  nonlinear  fixed-point  prediction  problems.  Gauvrit  [40] 
developed  a  Gaussian  sum  filter  to  track  targets  in  clutter  with  unknown  noise 
vctriances.  Tanaka  and  Katayama  [41]  devised  a  robust  Kalmain  filter  that  consists 
of  the  Kalman  filter  plus  MAP  estimates  of  the  selection  parameters  that  specify 
noises  from  the  Gaussian  sum  densities.  Their  filter  is  actually  a  smoother  applied 
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to  a  linear  system  with  Gaussian  sum  noises.  Kitagawa  [42]  modeled  sudden 
changes  of  trend  or  seasonal  components  due  to  the  structural  changes  of  the 
economic  system  by  the  presence  of  outliers  by  a  nonGaussian  model.  He  used 
Gaussian  sum  approximations  of  the  nonGaussian  densities. 


2.2  Adaptive  Kalman  Filtering 


In  a  typical  application,  the  various  parameters  that  describe  the  system  are 
assumed  to  be  known.  Even  if  they  are  time-varying,  the  variation  is  assumed  to 
be  known.  The  parameters  are  then  used  in  the  Kalman  filter  design  as  the  true 
model  of  the  system.  For  many  physical  problems,  the  parameters  may  not  be 
known  exactly  or  may  change  at  unknown  times.  In  such  cases,  it  is  highly 
desirable  to  design  the  filter  to  be  self-learning,  so  that  it  can  adapt  itself  to  the 
particular  situation  at  hand. 

One  solution  to  this  problem  was  formulated  first  by  Magill  [16].  He 
considered  the  problem  of  estimation  of  a  Gaussian  random  process  when  some 
parameters  of  the  process  are  initially  unknown  and  remeiin  constant  with  time. 
The  parameters  are  assumed  to  come  from  a  finite  set  of  known  values.  The 
optimal  adaptive  estimate  is  a  weighted  sum  of  conditional  estimates,  which  are 
formed  by  a  bank  of  Kalman  filters.  Each  Kalman  filter  is  based  on  a  particular 
parameter  set.  The  weighting  coefficients  are  determined  by  a  nonlinear  function 
of  the  measurement  residuals  of  the  filters.  The  measurement  residual  of  the  filter 
possessing  the  parameter  set  that  matches  the  actual  set  will  have  the  smallest 
expected  value  (typically,  zero  mean).  The  residuals  of  all  the  mismatched  filters 
will  be  biased.  Under  the  Gaussian  assumption,  the  probability  of  the  matched 
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filter  will  be  the  largest  among  all  the  filters. 

Along  similar  lines,  Ackerson  and  Fu  [17]  developed  an  adaptive  state 
estimator  for  a  linear  system  operating  in  a  switching  environment.  The  noise 
affecting  the  system  comes  from  a  group  of  Gaussian  densities  acting  one  at  a  time. 
The  transitions  from  one  noise  source  to  the  next  is  determined  by  a  Markov 
transition  probability  matrix.  This  effort  differed  from  that  of  Magill  in  that  the 
parameter  set  describing  the  noise  source  does  not  remain  constant  with  time,  but 
is  allowed  to  switch  from  one  set  to  another  at  random  intervals.  The  adaptive 
estimator  has  the  same  growing  memory  problem  as  that  of  Alspach,  and 
consequently  a  suboptimal  finite  memory  estimator  was  proposed. 

Following  Ackerson  and  Fu,  Jaffer  and  Gupta  [18]  developed  an  adaptive 
estimator  with  a  fixed  number  of  filters  for  the  problem  of  signal  estimation  under 
conditions  of  intermittent  failure  in  the  observations.  Fujita  and  Fukao  [19] 
established  the  validity  of  the  separation  theorem  for  the  problem  of  determining 
an  overall  optimal  control  policy  for  a  linear  system  with  interrupted  observations. 
Bruckner,  Scott,  and  Rea  [20]  extended  the  adaptive  algorithm  of  [17]  to  the 
switching  of  plant  parameters  in  an  air  traffic  control  system. 

A  slightly  different  version  of  the  adaptive  filter  was  introduced  by  Moose  [23]. 
He  modeled  the  variations  in  the  switching  plant  or  switching  environment  by  a 
semi-Markov  process.  Briefly  stated,  a  semi-Markov  process  is  a  probabilistic 
system  that  makes  its  state  transitions  according  to  the  transition  probability 
matrix  of  a  conventional  Markov  process.  However,  the  amount  of  time  spent  in 
state  i  before  the  next  transition  to  state  j  is  a  random  variable  [21],  [22].  By 
incorporating  the  semi-Markov  statistics  into  the  learning  portion  of  the  adaptive 
filter,  the  problem  of  growing  memory  is  completely  eliminated. 

This  technique  was  successfxilly  used  by  Moose  and  Wang  [24]  in  the  switching 
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plant  problem.  Other  applications  include:  modeling  large  scale  depth  variations  of 
submarines,  Moose  [25];  modeling  accelerations  of  a  maneuvering  target  in  three- 
dimensional  space  using  spherical  observations  of  noisy  radar  data,  Gholson  and 
Moose  [26];  incorporating  a  correlated  acceleration  model  in  an  airborne  target 
tracking  system.  Moose,  VanLandingham,  and  McCabe  [27];  passive  underwater 
tracking  using  polar  coordinates,  McCabe  and  Moose  [28];  using  the  correlated 
acceleration  model  in  the  passive  underwater  range  tracking  problem.  Moose  and 
Dailey  [29];  extending  the  algorithm  to  passive  underwater  depth  tracking.  Moose 
and  Godiwala  [30];  modeling  unknown  biases  in  measurement  devices.  Moose, 
Sistanizadeh,  and  Skag^ord  [31];  and  incorporating  a  nonlinear  system  block  in  an 
underwater  tracking  system  which  decouples  the  bearing  and  range  estimators. 
Moose  [32], 

Many  others  have  contributed  to  the  field  of  adaptive  state  estimation  and 
multiple  model  filtering.  Tugnait  and  Haddad  [33],  [34]  studied  state  estimation  in 
linear  systems  with  random  Markovian  noise  statistics.  The  noise  comes  from  a 
group  of  Gaussian  distributions  with  different  means  and  covariances.  The 
transitions  are  determined  by  an  unknown  Markov  transition  probability  matrix. 
Akashi  and  Kumamoto  [35]  devised  an  adaptive  filter  in  which  the  overall  estimate 
is  calculated  using  a  relatively  small  number  of  individual  conditional  estimates 
sampled  at  random  from  a  larger  set  of  individual  conditional  estimates. 

Maybeck  and  Suizu  [46]  and  Tobin  and  Maybeck  [47]  used  multiple  model 
filtering  in  the  problem  of  accmately  tracking  the  a.zimuth  and  elevation  of  a 
highly  maneuverable  airborne  target,  using  outputs  from  a  forward-looking  infrared 
sensor  as  measurements.  Blom  and  Bar-Shalom  [48]  described  a  method  of  timing 
the  merging  of  various  individual  filters  in  an  adaptive  estimator  to  overcome  the 
growing  memory  problem.  Bar- Shalom,  Chang,  and  Blom  [49]  used  the  multiple 
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model  filter  in  the  problem  of  input  estimation,  providing  estimates  of  the 
magnitude  and  onset  time  of  the  input.  Emre  and  Seo  [50]  presented  a  global 
modeling  approach  to  data  association  of  multiple  targets  and  maneuver 
detection/estimation  of  single  targets.  These  two  problems  are  solved 
simultaneously  using  system  identification  techniques,  which  leads  to  a  multiple 
model  estimator  with  a  finite  number  of  terms. 


2.3  Alternate  NonGaussian  Filtering 

Several  alternatives  to  the  Gaussian  sum  and  adaptive  Kalman  filter 
algorithms  have  been  proposed  for  systems  containing  nonGaussian  inputs. 
Mtisreliez  [43]  introduced  an  approximate  nonGaussian  filtering  method  for  linear 
systems.  Instead  of  approximating  the  densities  using  Gaussian  sums,  he  used  a 
nonlinear  score  function  of  the  measurement  residuals  to  produce  the  Kalman  filter 
equations.  The  technique  works  best  however  when  only  some  of  the  random 
processes  are  nonGaussian,  with  the  rest  being  Gaussian.  Masreliez  and  Martin  [44] 
extend  this  approximate  nonGaussian  filtering  method  by  applying  an  influence 
function  of  min-max  robustness  theory  to  replace  the  previously  used  score 
fimction.  Tsai  and  Kurz  [45]  devised  a  robust  Keilman  filter  beised  on  an  m-interval 
polynomial  approximation  method  for  unknown  nonGaussicin  noise.  As  the  score 
function  of  [43]  is  partitioned  into  m  segments,  each  is  better  approximated  with  a 
low-order  polynomial  function  which  is  easier  to  implement  and  adapt  to  the 
density  function  of  the  residual  process. 

Other  robust  estimation  techniques  are  used  by  Kirlin  and  Moghaddamjoo  for 
systems  with  unknown  inputs  and  nonGaussian  measurement  errors.  In  [51]  the 
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input  forcing  function  and  measurement  bias  are  estimated  using  a  moving  data 
window  median  calculation.  The  plant  and  measurement  noise  covariances  are 
found  using  biweights  and  rank  correlation.  In  [52]  they  discussed  adaptive 
estimation  of  unknown  inputs  and  measurement  noise  covariance  using  a  running 
window  curve-fitting  algorithm.  Estimation  of  the  plant  noise  covariance  is 
accomplished  using  an  independent  technique,  based  on  the  residuals  and  a 
stochastic  approximation  method. 
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3.0  NONGAUSSIAN  ESTIMATION 


This  chapter  discusses  the  mathematical  details  of  the  Gaussian  sum  approach 
to  nonGaussian  signal  estimation.  It  begins  with  an  outline  of  Alspach’s 
development  of  the  Gaussian  sum  estimation  technique  [6-8].  A  major  problem 
with  this  technique  is  indicated  which  necessitates  a  modified  approach.  Next,  a 
detailed  description  of  the  nonGaussian  signal  model  is  given  which  leads  to  an 
alternate  formulation  of  the  Gaussian  sum  density  approximation.  The  initial 
parameter  vector  describing  the  density  approximation  is  foimd  using  a  curve 
fitting  procedure.  A  modified  Gaussian  sum  adaptive  filter  is  developed  for  a 
general  discrete-time  system  and  a  simulation  example  is  given.  Finally,  key 
differences  between  the  Gaussian  sum  and  modified  Gaussian  sum  algorithms  are 
examined. 


3.1  Gaussian  Sum  Estimation  Technique 

As  previously  stated  in  chapter  2,  am  estimation  technique,  applicable  to  both 
linear  systems  with  nonGaussian  inputs  2md  nonlinear  systems  with  Gaussian 
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inputs,  has  been  developed  by  Alspach  [6-8].  The  basic  problem  is  to  estimate  the 
state  of  ^  discrete-time  system  from  the  current  measurement  sequence 
Zfc  =  The  “best”  or  optimal  estimate  of  the  system  state 

is  chosen  as  that  estimate  which  minimizes  the  mean-square  error  between  and 
Xfc.  This  results  in  the  conditional  mean  estimate 

^k  =  E[x,,\Z,,\=\  a:fcp(rfc  I  Zfc)  dxfc  (3.1.1) 

J  ~  00 

where  p(ifc  |  is  the  conditional  density  function  of  the  state  given  the  current 
measurement  sequence. 

The  Gaussian  sum  estimator  is  developed  by  Alspach  using  the  following 
system  model 


a;*  =  +  tWfc-i  (3.1.2) 

*3:* -I- Ufc  (3.1.3) 


where  the  initial  state  has  a  Gaussiein  sum  density  of  the  form 


4o 

p(*o)  = 


(3.1.4) 


Assume  that  the  plant  and  measurement  noise  processes,  Wf^  and  Vjt,  are 
statistically  independent,  nonGaussian,  white  noise  sequences  with  Gaussian  sum 
densities 


ri=l 


(3.1.5) 


NONGAUSSIAN  ESTIMATION 


15 


and 


P(«fc)  =  Z  RkJ  (3.1.6) 

m=l 

The  key  idea  of  Gaussian  sum  estimation  is  to  use  (3.1.4)  -  (3.1.6)  in  the 
approximation  of  p(zfc  1  as  a  sum  of  Gaussian  densities.  Using  Bayes’  rule,  this 
conditional  density  function  can  be  determined  recursively  [7]  from 

PiXk\Zk-l)pi2k\Xk) 

TOO 

p{xk\Zk-i)=  Pixk-i\Zk-i)pixk\xk-i)dxt.i  (3.1.8) 

J  —OO 

where  the  normalizing  constant  p{zk  \  .^/t-i)  iii  (3.1.7)  is  given  by 

Pi^k  I  Zk-i)  =  (°°  p{Xk  I  Zk-i)pizk  I  Xk)dXk  (3.1.9) 

The  density  p{zk  \  Xfc)  in  (3.1.7)  is  determined  by  the  measurement  noise  density 
p{vk)  of  (3.1.6)  and  the  measurement  equation  (3.1.3).  Similarly,  the  density 
p{xk  I  Zfc_i)  in  (3.1.8)  is  determined  by  the  plant  noise  density  p{wk)  of  (3.1.5)  and 
the  plant  equation  (3.1.2).  Knowledge  of  p{zk  1 1*),  p{xk  |  Xfc-i),  and  the  initial  state 
density  p(xo)  of  (3.1.4)  determines  p{Xk  \  Zk)  for  all  k.  Alspach  [6],  [7]  derives  two 
additional  theorems  which  are  briefly  stated  here.  The  measurement  update 
theorem  provides  passage  of  p{xk  \  Zk-i)  to  p{xk  |  Zk),  while  the  time  update 
theorem  provides  passage  of  p{xk  |  Zk)  to  p(zfc+i  I  Zk)- 

Meamirement  Update  Theorem 

Suppose  that  the  stage  prediction  density  function  is 
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(3.1.10) 


i'k 

p{xk  I  Zk-i)  =  N[/ilk,.,Pfc,.] 


Then  the  stage  filtering  density  is  given  by 


pMZk)=E^kMPk.Pkj]  (3.1.11) 

i=i 

where  the  mean  and  covariance  terms  are  found  from  the  Kalman  filter 
mecisurement  update  equations 


Pkj  =  P'ki  +  Kkj{zk  -  HkP'k.  -  VkJ  (3.1.12) 

Pk.  =  [I-KkMk]Pk,  (3.1.13) 

Kk.  =  P'k,HliHkP'kMl  +  RkS  (3.1.14) 

The  filtering  density  (3.1.11)  is  the  result  of  a  double  sum  formed  by  the 
product  of  the  prediction  density  (3.1.10)  and  the  measurement  noise  density 
(3.1.6).  The  double  sum  is  rewritten  as  a  single  sum  with  upper  summation  limit 

U  =  i'krk  (3.1.15) 


produced  by  the  product  of  upper  summation  limits  ('k  of  (3.1.10)  and  of  (3.1.6). 
As  a  result,  the  number  of  terms  of  the  filtering  Gaussian  sum  density  grows 
geometrically  at  each  measurement  update  with  a  ratio  equal  to  the  number  of 
Gaussian  stun  terms  of  the  measurement  noise. 

The  Okj  term  of  (3.1.11)  is  foimd  from 


%  =  !' 


o'kjlkJVkj 


X>^kiikJVkj 


i=l 


(3.1.16) 
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where  the  index  of  summation  j  is  defined  as 


j  =  i  +  (m-  1)^; 


m  = 


The  W term  of  (3.1.16)  is  the  Gaussian  probability  density  f\mction 


(3.1.17) 


with  terms  that  are  readily  foimd  from  measurement  update  equations  (3.1.12)  and 
(3.1.14).  The  ctfcy  term  (3.1.16)  is  a  nonlinear  function  of  the  current  measurement 
data,  with  its  denominator  acting  as  a  scale  factor  so  that  the  terms  remain 
bounded  between  0  and  1  at  each  iteration. 


Time  Update  Theorem 

Given  that  the  stage  filtering  density  function  is 

I ZJ  =  (3.1.18) 

>=1 

Then  the  stage  prediction  density  is  given  by 

I  (3.1.19) 

i=I 

where  the  mean  and  covariance  terms  are  found  from  the  Kalman  filter  time 
update  equations 


(3.1.20) 
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The  prediction  density  (3.1.19)  is  the  result  of  a  double  sum  formed  by  the 
product  of  the  filtering  density  (3.1.18)  zmd  the  plant  noise  density  (3.1.5).  The 
double  sum  is  rewritten  as  a  single  sum  with  upper  summation  limit 

(k+x  =  (kqk  (3.1.22) 

produced  by  the  product  of  upper  summation  limits  of  (3.1.18)  and  of  (3.1.5). 
As  a  result,  the  number  of  terms  of  the  prediction  Gaussian  sum  density  grows 
geometrically  at  each  time  update  with  a  ratio  equal  to  the  number  of  Gaussian 
sum  terms  of  the  plant  noise. 

The  aji.4.1.  term  of  (3.1.19)  is  found  from 

^kjPkn  (3.1.23) 


where  the  index  of  summation  i  is  defined  as 


*■  =  >  +  («- 1)^* 


j  =  K-;ik 

n  =  l,...,9fc 


Alspach’s  two  theorems  show  that  for  the  system  defined  by  (3.1.2)  -  (3.1.6) 
the  Gaussian  sum  form  repeats  itself  from  one  stage  to  the  next  so  that  (3.1.10) 
and  (3.1.11)  are  the  general  forms  for  an  arbitrary  stage.  The  measurement  update 
equations  (3.1.12)  -  (3.1.14)  and  time  update  equations  (3.1.20)  and  (3.1.21) 
comprise  a  bank  of  Kalman  filters  operating  in  parallel.  Using  the  Gaussian  sum 
approximation  of  the  conditional  density  function  (3.1.11),  the  overall  state 
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estimate  is  found  as  a  weighted  sum  of  the  individual  conditional  mean  values 
from  each  filter 


Xk  =  E[Xk  \Zk]=  ^OCkjfikj 

i=i 


(3.1.24) 


The  corresponding  conditional  covariance  is 

4 

Pfc  =  E[{xk  -  x/t)(xfc  -  I  Z^]  =  +  (**  “  l^kj)ixk  -  t^kjf]  (3.1.25) 

i=i 

It  is  evident  from  (3.1.15)  and  (3.1.22)  that  the  number  of  Gaussian  terms 
increases  at  each  stage  of  the  estimator.  This  growing  memory  problem  is  a  serious 
limitation  in  Alspach’s  development.  A  modified  approach  is  required  to  alleviate 
this  limitation. 


3.2  NonGaussian  Signal  Model  Development 

Let  u  be  a  random  noise  process  or  random  input  signed  with  a  nonGaussian 
density  fimction.  It  can  be  modeled  as  the  sum  of  two  statistically  independent 
random  processes 


u  =  b  +  n  (3.2.1) 

The  first  term,  b,  is  a  semi-Markov  process  with  state  transitions  governed  by  the 
transition  probability  matrix  of  a  conventional  Markov  process.  Markov  processes 
have  the  property  that  a  transition  is  made  at  every  time  instant.  The  transition 

ao 
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may  return  the  process  to  the  state  it  previously  occupied,  but  a  transition  occurs 
nevertheless.  However,  in  the  semi-Markov  case,  the  amount  of  time  between 
transitions  is  a  random  variable  [22].  The  value  of  b  is  randomly  selected  from  a 
fixed  set  of  discrete  values,  characterized  by  a  delta  probability  density  function 

p{b)='£PAh-h,)  (3,2.2) 

«=1 


with 


M 

^Pj  =  l;  Pi>0  for  i  =  1,2,...,M 

i=l 


(3.2.3) 


This  process  can  be  thought  of  as  a  randomly-switching  bias,  eaeh  bias  value  6,- 
having  probability  P,-. 

The  second  term,  n,  is  a  zero  mean  white  Gaussian  process  with  variance  <t*. 
With  both  densities  known,  the  density  function  of  it  can  be  found  using  the 
convolution  relationship  between  u,  n,  and  b  [53] 

P(«)=  [  Pn{^-^)Pb{b)db  (3.2.4) 

J  — OO 

where  p„(it  —  6)  is  the  Gaussian  density  with  n  =  u  —  b 

i(u-6 
e  ^ 


Pn(«  -  6)  =  — 


^2^ 


(3.2.5) 


Substituting  (3.2.2)  and  (3.2.5)  into  the  convolution  integral  (3.2.4)  gives 


i=l 


(3.2.6) 
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Interchanging  integration  and  summation 


M 

p(“)= 

i=l 


A 

-oo-\j27r(T„ 


8{h-hi)dh 


Using  the  sifting  property  of  the  delta  function  [54] 

f{a)  =  [  f{x)8{x  —  a)dx 

J  — OO 


the  integral  of  (3.2.7)  is  evaluated  as 


»=i 


or 


p(„)= 

«=i 


(3.2.7) 


(3.2.8) 


(3.2.9) 


(3.2.10) 


Thus,  the  nonGaussian  density  function  of  u  can  be  modeled  as  a  Gaussian  sum. 
The  weight  Pj  of  each  Gaussian  term  is  the  probability  of  the  bias  term.  The 
bias  term  6,-  is  restricted  to  be  randomly  selected  from  the  same  fixed  set  of  bias 
values  at  each  iteration.  Using  this  model  in  conjunction  with  the  modified 
Gaussian  sum  adaptive  filter  developed  in  the  next  section  avoids  the  growing 
memory  problem  of  Alspach’s  development. 

Selecting  the  parameters  P,-,  5,-,  and  <t„  in  (3.2.10)  to  obtain  the  “best” 
approximation  Pqs  to  some  actual  nonGaussian  density  function  is 
accomplished  by  means  of  minimizing  the  L*  norm 
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fOO  M 

II  Pa  -  Pas  II*  =  I  P/i(w)  -  £  Pi  N[6.,  <^1] 

J  -oo  ,=1 


du 


(3.2.11) 


This  curve  fitting  exercise  can  be  done  off-line  using  several  values  of  M  until  a 
suitable  trade-off  between  minimum  norm  and  minimum  M  is  obtained.  Alspach 
[6],  [7]  performed  this  curve  fitting  procedure  using  and  norms  for  a  imiform 
density  and  a  Gamma  density.  It  was  found  that  minimizing  the  norm  resulted 
in  many  fewer  terms  in  the  Gaussian  sum  and  a  considerably  better  looking 
approximation  for  both  densities  compared  to  minimizing  the  norm. 

Figure  1  compares  a  Gamma  density  with  a  four-term  Gaussian  sum  density 
approximation  minimizing  the  norm.  The  Gaussian  sum  curve  is  shown  to  fit 
the  Gamma  curve  reasonably  well.  The  Gamma  density  used  is 


It  >  0 

u  <  0 


(3.2.12) 


Each  term  of  the  Gaussian  sum  has  a  fixed  value  of  <t„  =  1.  Table  1  lists  the 
values  of  Pj  and  6,-  used  in  the  Gaussian  sum. 


Table  1. 

Gaussian  sum  P„  6,-  values,  <7„  =  1. 


i 

Pi 

bi 

1 

0.081 

2.537 

2 

0.432 

2.553 

3 

0.356 

4.555 

4 

0.131 

6.933 
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3.3  Modified  Gaussian  Sum  Adaptive  Filtering 


A  modified  Gaussian  sum  adaptive  filter  is  now  developed  for  a  linear  system 
with  a  deterministic  input  signal  and  nonGaussian  plant  and  meeisurement  noise. 
The  system  is  modeled  in  standard  discrete-time  state-space  form  as 

^Jt+i  =  +  Tufc  -t- 

“'fc  = 

Zk  =  Hxk  -I-  Ufc 
Vk  =  h  +  Tlk 

where  X/t+i  is  the  state  vector 

U/t  is  a  known  deterministic  input 
lUfc  is  the  vector  Gaussian  sum  signal  model  of  the  actual 
nonGaussian  plant  noise  process,  comprised  of  semi- 
Markov  bias  vector  and  zero  mean  white  Gaussian 
noise  mjt  with  covariance  Q 
Zi^  is  the  measurement  vector 

Vfc  is  the  vector  Gaussian  sum  signal  model  of  the  actual 
nonGaussian  measurement  noise  process,  comprised  of 
semi-Markov  bi^ls  vector  and  zero  mean  white  Gaussian 
noise  with  covariance  R 

$,r,^,fr  are  the  respective  constant  trainsition  matrices 

and  the  random  quantities  a^,  m^,  6^,  are  assumed  to  be  mutually 
statistically  independent. 


(3.3.1) 

(3.3.2) 

(3.3.3) 

(3.3.4) 
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The  optimal  estimate  ffc+j  of  the  state  vector  is  found  by  minimizing  the 
mean-square  error  between  ijt-t-i  This  results  in  the  conditional  mean 

estimate 


^fc+i  I  ~  2:*.^jp(xj.+i  I  ^k+i)  ^^k+i  (3.3.5) 

J  —  oo 

where  is  the  current  measurement  sequence  {zi,22i-  •  -i^k+i}-  The  conditional 

density  function  of  (3.3.5)  can  be  written  as  the  ratio  of  the  corresponding  joint  and 
marginal  densities 


=  (3.3.6) 

The  two  bias  vectors  a  and  b  are  explicitly  brought  into  (3.3.6)  by  considering  the 
joint  density  p(3Jfc+i,Zn.+j)  to  be  a  marginal  density  found  from 
p{xk+i,Zk+i,a  =  a„6  =  bj)  by  summing  over  the  a  and  6  terms 


N  M 

?(*.+■  I  Zi+i)  = 


(3.3.7) 


Expanding  (3.3.7)  and  using  p{xk+i-,Zk^i,ai,bj)  as  shorthand  for 
Pixk+i^Zk+i,a  =  ai,b  =  bj)  gives 


N  M 

Z]  1  ^k+i^ai,bj)p{Zk+i:ai,bj) 

I  Zm)  = 
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as 


N  M 


P(^fc+i  I  ^.+x)  =  - 

P(‘^/b+l) 


and  finally 


N  M 


P{^k+i  I  ^fc+i)  =  E  E^a^fc+i  I  ^fc+i,a.,6j)p(a„6,  |  Z^+i) 

i=i  i=\ 


(3.3.8) 


Substituting  (3.3.8)  into  the  conditional  mean  (3.3.5)  results  in 


^fc+i  —  [  ^fc+i| 

J  —  OO 


'  N  M 

E  ^Pi^k+i  I  ■^fc+nO,>6j)p(a„6  I  Zi^^i) 

i=i  j=i 


^^k+i  (3.3.9) 


Interchanging  integration  and  summation  gives 


N  M 

®fc+i  =  E  E 

,=1  J=1  L-'  -OO 


-OO 

®fc+lp(®fe+l  ( •^ifc+l>0j)^j[) 

J  —  OO 


Pi^ii^j  I  ^k+l)  (3.3.10) 


The  bracketed  integral  in  (3.3.10)  is  the  conditional  mecin  estimate  of  given 
that  a  =  and  b  =  bj,  denoted  by 


®fc+l  ~  ®fc+lP(®fc+l  I  ^k+lf^ii^j)  ^^k+1 


(3.3.11) 


In  effect,  represents  the  estimate  for  the  density  combination  from  the 
two  Gaussian  sums.  A  set  of  IV  x  Af  (NAf)  estimators  is  needed  to  provide  all  the 
individual  xj.^+i  estimates.  The  overall  estimate  from  (3.3.10)  and  (3.3.11), 


N  M  ,, 

^fc+1  =  E  E®*+iP(ap^  I  ^fc+i) 

i=l  >=1 


(3.3.12) 
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is  a  weighted  sum  of  the  NM  individual  estimates.  The  weighting  factor 
is  the  probability  that  a  =  and  b  =  bj  given  the  current 
measurement  sequence. 

Each  estimate  (3.3.11)  is  found  from  a  modified  linear  state  estimator. 
Rewriting  the  measurement  sequence  as  the  conditional  density 

function  of  (3.3.11)  is 


P(x»+.  I  Zi. a,, 4,)  - 


(3.3.13) 


Expanding  as  before  gives 


/  t  17  L  \  P(^fc+l  I  I  /001,^^ 


The  first  term  of  the  numerator  of  (3.3.14)  is  Gaussian  since  a  =  a„  and 
b  =  bj  are  given.  The  conditional  mean  value  is  found  by  first  combining  (3.3.3) 
and  (3.3.4)  at  time  iteration  A:+l 


^fc+i  —  +  ^fc+i  +  ”fc+i 


(3.3.15) 


and  then  taking  the  expected  value 

'^le+l  ~  ^[^k+1  I  ~  ^^k+1  "I"  "I"  •^[’^fc+1  I  ^k+li^if^jf^k] 

Since  njt+i  has  a  mean  value  of  zero,  the  expected  value  is 


mi»+i  =  Hxk+i  +  bj 


(3.3.16) 
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The  conditional  covariance  is  found  next  as 


Q+i  =  E[{zk+i  -  ^k+i){zk+i-mi\iy  I  Xk+i,ai,bj,Zk] 

=  E[{Hxk+i  +  h+i  +  Uk+i  -  Hxk+i  -  bj){zt+i  -  |  ifc+i,a.,fe_,,Zfc] 

Knowing  that  b  =  bj 

—  -^[^ifc+l^fe+l  I  ^k+l^^ii^ji^k]  —  ^  (3.3.17) 

The  conditional  density  is  therefore 

Pi^k+i  I  Xk+i,ai,bj,Zt)  =  N[m‘4i,i2]  =  NlfTr^+i  +  bj,R]  (3.3.18) 

The  second  term  of  the  numerator  of  (3.3.14)  is  also  Gaussian  since  a  =  a,  and 
b  =  bj  are  given.  The  conditional  mean  value  is  found  by  first  combining  (3.3.1) 
and  (3.3.2) 

+  Tu^t  + (3.3.19) 


and  then  taking  the  expected  value 

^fe+i  =  ^^k+i  I  =  $£[1*,  1  ai,6^,Z*]  +  Tu*  +  ’^o,  +  <i!E[mk  1  a„6_,,Zfc] 

Since  m*  has  a  mean  value  of  zero,  the  expected  value  is 

21^+1  =  +  Tufc  +  ’Ja,  (3.3.20) 
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where  is  the  ■prediction  at  time  iteration  /j+l  given  prior  measurements  up  to 
only  iteration  fc,  while  x'^  is  the  previous  estimate  at  iteration  k  given 
measurements  up  to  iteration  k. 

The  conditional  covariance  is  found  next  as 

I 

=  +  Tufc  +  'J'Ofc  +  -  Tufc  -  'J’a,)(Xfc+i  -  |  a„  bj,  Z^] 


Since  a  =  a,  and  recalling  that  and  m^t  are  statistically  independent 


=  ^E[{xk  -  xi^)ixk  -  I  |  ai,bj,Zk]^'^ 

Letting  P)/  =  E[{xif  —  xj/)(xfc  —  I  aj,6^-,Z/t]  be  the  conditional  covariance  of  the 
error  term  x^  —  x'J, 

Mi/+i  =  (3.3.21) 

Since  the  covariances  Q  and  R  are  the  same  for  each  ij*^  estimator,  the  resp>ective 
conditional  covariances  simplify  to  P)/  =  P*  and  M)/^i  =  eind  the  conditional 

density  is  therefore 

I  ai,6„Z,)  =  (3.3,22) 

The  leaist-meau-square  (LMS)  estimator  is  foimd  by  taking  the  derivative  of 
the  natural  logarithm  of  (3.3.14)  with  respect  to  x^+i,  setting  the  result  equal  to 
zero,  and  determining  the  value  of  x^^j  =  that  produces  this  result.  Since  the 
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denominator  term  of  (3.3.14)  does  not  depend  on  Xt+i,  it  will  not  play  a  role  in 
determining  xJ4i-  Now,  set 

A  [In  p(Xfc+i  I  Zfc,  Zk+i,  a„  6,)]  =  0  (3.3.23) 

Using  (3.3.18)  and  (3.3.22)  in  (3.3.14),  (3.3.23)  becomes 

i  (In  N|mt<„,K|  +  b  M»+,]  -  0)  =  0 

^^k+1 

“  ^k\iyR~\2k+i  -  ml-Vi)  -5(xfc+i  -  2t^+ir-Wjii(Xfc+,  -  xiVi)]  =  0 

(3.3.24) 

Combining  (3.3.16)  and  using  a  rule  of  matrix  differentiation  (if  C  =  C^) 

^Ax  +  byC{Ax  +  b)  =  2A^C{Ax  +  b)  (3.3.25) 

the  first  term  of  (3.3.24)  is  arranged  in  the  form  of  (3.3.25) 

+  ^k+l  -  ^jyR~\-HXk+i  +  2fc+i  -  bj)] 
with  A  =  —H,  b  =  —  bj,  and  C  =  R~^.  Applying  (3.3.25)  results  in 

-H-^R-^Hx,^,  -  +  b,)  (3.3.26) 

Similarly,  the  second  term  of  the  derivative  (3.2.24)  is 

(3.3.27) 
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Combining  (3.3.20),  (3.3.26),  and  (3.3.27)  and  setting  x^+i  =  gives 

-  zt+i  +  bj)  -  -  ^xi^  -  Tu^  -  ^a.)  =  0 

or 

(A/ri,  +  -  ij)  +  MfJ +  rut  +  *0(13.3.28) 

Now,  define 


Pkh  =  (A^kh  +  (3.3.29) 

Using  (3.3.29)  and  solving  for  in  (3.3.28)  gives 

-  h)  +  +  ru,  +  »a()  (3.3.30) 

To  rewrite  -Pk+iAfkln  first  premultiply  (3.3.29)  by  Pfc+i,  giving  the  identity  matrix 

/  =  Pk^iiMj^h  +  (3.3.31) 

then  solve  (3.3.31)  for  Pic+iAf^li  and  substitute  into  (3.3.30) 

=  Pfc+i^'^i2-‘(^fc+i  -  bj)  +  (/  -  +  r«*  +  ^a,)  (3.3.32) 

Now,  let 


J^k+i  =  Pk^iAf^R-^ 


(3.3.33) 
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be  the  Kalman  gain,  and  rearrange  using  (3.3.29)  to  produce 

=  +  H-^ 

or 

^fc+i  =  +  R)-^  (3.3.34) 

Substituting  (3.3.33)  into  (3.3.32)  and  rearrainging  produces  the  familiar  Kalman 
filter  equation  below,  with  modifications  to  allow  for  the  two  bias  terms  of  the 
Gaussian  sum  densities 


®fc+i  =  +  Tufc  +  +  Kk+i[zk+i  -  bj  -  +  Tujk  +  ’Joi)]  (3.3.35) 


As  a  final  step,  substitute  (3.3.33)  into  (3.3.31)  to  produce 


or 

(3.3.36) 


With  the  individual  ij*^  estimate  (3.3.35)  determined,  the  next  step  in 
producing  the  overall  estimate  (3.3.12)  is  to  find  the  weighting  term  p(a„5j  1  ^t+i). 
Using  Zic+i  =  again,  the  weighting  term  becomes 


Piovbj\Z^+i)  = 


p{^k+i,avbj,Zi,) 
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Using  Bayes’  theorem  produces 


p(ai,6jlZfc+,)  = 


P(^t+i  I  ai,f>yZt)p{ai,bj  I  Zfc) 

P(2fc+i  I  Zk) 


(3.3.37) 


The  first  term  of  the  numerator  can  be  modeled  as  a  Gaussian  density  if  the 
bias  terms  switch  slowly  compared  to  the  time  interval  k.  This  assumption  will  be 
made  here  and  has  been  validated  by  extensive  simulation  and  analysis  by  Moose 
and  Wang  [24].  Combining  (3.3.15)  and  (3.3.19),  2^+1  is  written  as 


^fe+i  —  +  Fujt  +  +  ^mjt)  +  (3.3.38) 


Taking  the  expected  value,  the  conditional  mean  is 
^fc+i  =  E[zk+i  I  a.,6_,,Zfc] 

=  mE[Xk  I  a.,  bj,  Zfc]  +  HTut  +  +  6, 

or 

=  Hi^xi^  +  r«fc  +  ^a,)  +  bj  (3.3.39) 

The  conditional  covariance  is  found  next  as 

«i'+i  =  -  ni.r  I  ‘■..I’y.  ZJ  (3-3.40) 

Using  (3.3.38)  and  (3.3.39),  the  z^+i  term  is  written  as 


Zk+i  -  2fc-Vi  =  +  Tufc  +  $0*  +  ’Jmfc)  +  6fc+i  +  rifc+i  -  H{^xi^  +  Tu^  +  -  bj 

(3.3.41) 
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Inserting  (3.3.41)  into  (3.3.40)  gives 


=  ■E[(-ff{^(a;fc-2fc'')  +  ’*'”*jk}  +  "fc+i)(2fc+i  -2t+i)’^  I  ai,bj,Zk] 

and  using  (3.3.17)  and  (3.3.21)  produces 

iZjJ.,  =  +  R  (3.3.42) 

Since  the  covariances  R  and  M)^^i  =  Mf.^i  are  the  same  for  each  estimator,  the 
conditional  covariance  simpUfies  to  ~  -^fc+n  conditional  density  is 

therefore 


P(^it+i  1  =  N(2l:-'+i,iZit+il 

=  N[^($ii-'  +  +  6,,  HMk+iH'^  +  /?]  (3.3.43) 

The  second  term  of  the  numerator  of  (3.3.37)  can  be  rewritten  to  explicitly 
include  the  time  interval  for  the  two  bias  terms 


P(««  bj\Zk)  =  P{afc+i  =  h+{  =  h  I  Zk)  (3.3.44) 


Using  Bayes’  theorem  md  the  definition  of  marginal  densities,  (3.3.44)  is  expanded 
into 


PMj\Zk) 


N  M 

5Zp(Ofc+l  =  OnO*  =  ®o»^it+l  =  =  b^,Z^) 

a=l  /3=1 

P{Z,) 


N  M 

=  X]  p(®fc+l  =  **•’  ^k+l  -  I  ^k  =  h  =  V  ^fc)p(“fc  =  ^k  =  Zk)  (3.3.45) 

£»=1  0=1 
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Recalling  that  a;(.+i  ^k+i  assumed  statistically  independent,  and  given  that 
Zfc,  a*  =  a^,  and  6^  =  known,  (3.3.45)  becomes 

N  M 

=  X]  p(°fc+i  =  I  =  Oa)pibk+i  =  I  f^k  =  Mp(“fc  =  =  ^/3  I  ^fc)  (3.3.46) 

a=l 0=1 

The  measurement  sequence  is  not  needed  in  the  first  two  terms  of  (3.3.46)  since 
fljt  =  Oq  and  fefc  =  6^  are  known  at  time  interval  k  and  are  independent  of  Z^.  The 
following  notation  will  be  used  to  express  the  three  terms  of  the  summation 
(3.3.46) 


C  =  Pi^k+l  =  a.  1  Ofc  =  Oa) 

(3.3.47) 

H^  =  P{bk+t  =  bj\b,  =  b^) 

(3.3.48) 

=  p{<ik  =  aa,bk  =  b0\Zk) 

(3.3.49) 

The  terms  (3.3.47)  and  (3.3.48)  are  Maikov  transition  probabilities  [22];  that 
is,  9'°  is  the  conditional  probability  that  a  =  a,  at  time  interv6j  fc+l,  given  that 
a  =  at  time  interval  k.  The  91^  term  is  similarly  defined.  The  term  (3.3.49)  is 
of  the  same  form  as  (3.3.37)  and  is  just  the  previous  weighting  term  at  the  previous 
time  interval  k. 

The  denominator  term  of  (3.3.37)  is  independent  of  ij.  Therefore  it  is  the 
same  for  each  estimator  and  becomes  a  scale  factor. 

Combining  (3.3.43)  and  (3.3.47)  -  (3.3.49),  the  weighting  term  (3.3.37)  is 
written  as 


•O&i  =  1 =  CJ*,N(n4.,R.+.lE  (3.3.50) 

Q=1 0=\ 
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where  CJ+i  is  a  scale  factor  determined  at  each  time  interval  such  that 

N  M 

E  =  1  (3-3.51) 

«=1  j=l 

guaxanteeing  that  the  sum  of  all  the  weighting  terms  (3.3.50)  is  equal  to  one.  The 
mean  and  covariance  of  the  Gaussian  density  function  axe  available  from  the 
Kalman  filter  equation  (3.3.35)  and  Kalman  gain  (3.3.34). 

The  equations  for  the  modified  Gaussian  sum  adaptive  filter  for  the  system  of 
(3.3.1)  -  (3.3.4)  axe  summarized  in  Table  2.  The  structure  of  the  overall  adaptive 
filter  is  a  bank  of  Kalman  filters  operating  in  parallel,  with  each  individual 
estimate  multiplied  by  its  own  corresponding  weighting  term.  The  estimator 
based  on  the  bias  terms  that  most  closely  matches  the  actual  bias  terms  of  the 
modeled  system  will  have  a  corresponding  weighting  term  that  tends  closer  to  one, 
while  the  weights  of  the  other  mismatched  estimators  will  tend  towards  zero.  A 
block  diagram  of  the  adaptive  filter  is  shown  in  Figure  2. 

A  simulation  example  is  presented  in  the  following  section. 
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Table  2. 


Modified  Gaussian  sum  adaptive  filter  equations. 

System:  Xt+i  =  +  Tu*.  +  'i'lWfc 

Wk  =  at  +  "ifc 
Zk  =  Hxk  +  Vk 
Vk  =  bk  +  Tlk 

N  M 

Overall  estimate:  it+i  =  ^  ^Xk\iP{ai,bj  \  Zk+i) 

i=i  i=i 

Kalman  filter  equation: 

*i'+i  =  +  Ttii  +  tdj  +  -bj- +  Tuj  +  taj] 

Kalman  gain  equations:  A/*+i  = 


Weighting  term: 

N  M  .  „ 

“'i'+i = p(o,,iy  I  Z.+0 = cj^,p(2,„  I  Oi,iy,Zi)E 

0=1  /3=1 

with  p(^fc+i  I  a„6j,  Zk)  =  ^H{^x'k^  +  Tuk  +  ^a.)  +  bj,HMk+iH'^  +  i2] 

C  =  ?(«*+!  =  a.  I  Ofc  =  «o) 

H^  =  P{bk+i=h\h  =  b0) 

^f  =  Piak  =  a^A  =  b0\Zk) 


N  M 

and  scale  factor  Cl^i  such  that  ^  ^  ^k\i  =  1 

i=i  j=i 
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Figure  2.  Modified  Gaussian  sum  adaptive  filter  structure. 
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3.4  Simulation  Example 


An  example  illustrating  the  modified  Gaussian  sum  estimation  technique  is 
now  presented.  A  first-order  system  is  used,  modeled  by  the  following  discrete¬ 
time  equations 


plant: 

(1  — e~“^l 

=  e  -t-  (1  e  “^)u;t  +  a 

(3.4.1) 

Wk  =  ak  +  TUk 

(3.4.2) 

measurement: 

Zk  =  Xk  +  Ufc 

(3.4.3) 

Vk  =  h  +  nk 

(3.4.4) 

The  value  of  a  is  0.6  and  the  sample  time  T  =  1  second.  For  simplicity,  let 
the  plant  noise  (3.4.2)  be  zero  mean  white  Gaussian  with  variance  Q  =  1.0. 
Therefore,  the  randomly-switching  plant  bias  term  a*.  =  0.  The  actual 
measurement  noise,  modeled  by  (3.4.4),  has  the  Gamma  density  of  (3.2.12),  with  a 
mean  and  variance  of  4.  The  measurement  bias  term  can  be  randomly  selected 
from  the  last  three  biais  terms  of  Table  1,  {2.553,  4.555,  6.933}.  The  first  bias  term 
of  Table  1,  {2.537},  is  not  used  since  it  is  so  close  in  value  to  the  second  bias  term. 
The  measurement  model  noise  term  is  zero  mean  white  Gaussian  with  variance 
R  =  1.0.  A  deterministic  input  of  u*.  =  10  is  used  throughout  the  simulation. 
Figure  3a  shows  the  measurement  and  state  sequences.  Note  how  the  measurement 
is  centered  about  14,  indicating  a  mean  value  for  the  Gamma  density  of  4. 

The  filter  is  initialized  with  equally-valued  weighting  terms 

wi  =  j,  for  j  =1,2,3  (3.4.5) 
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A  Maxkov  transition  probability  matrix  0^^,  consisting  of  elements,  is 
configured  with  a  high  probability  that  the  bias  term  does  not  switch  from  one 
value  to  another,  and  a  low  probability  that  the  bias  term  does  switch,  such  as 


ei'>  = 


.95 

.025 

.025 

.025 

.95 

.025 

.025 

.025 

.95 

(3.4.6) 


The  initial  value  of  the  state  and  state  estimate  is  Xq  =  Xq  =  20,  and  the  initial 
value  of  the  variance  of  the  error  ~  ^k+i  is  =  100.  The  overall  state 
estimate  and  the  state  are  shown  in  Figure  3b,  with  the  error  and  overall  state 
estimate  shown  in  Figure  3c.  Note  how  the  error  appears  to  be  zero  mean,  thus 
showing  that  the  MGS  adaptive  filter  removes  the  bias  effect  of  the  nonzero  mean 
Gamma  measurement  noise. 

Figures  4a,  b,  and  c  show  the  weighting  terms  for  each  of  the  6^  bias  terms.  In 
order  to  lessen  the  noise  of  the  weighting  terms,  a  first-order  lowpass  filter 


wi^.,  =  Au>J  +  (1  -  A)uij+i 


(3.4.7) 


is  used  to  smooth  the  weighting  terms,  where  A  =  0.7.  Figure  4d  can  be  thought  of 
as  the  overall  measurement  bias  estimate  6;^  due  to  the  nonzero  mean  Gamma 
measurement  noise.  Using  (3.3.35)  and  (3.3.50),  this  overall  bias  estimate  is  part  of 
the  overall  state  estimate  (3.3.12)  and  is  written  in  this  case  with  =  0  as 

M 

(3.4.8) 

i=i 

Note  how  this  overall  bias  estimate  approximately  models  the  mean  value  of  4  of 
the  Gamma  measurement  noise. 
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Iteration  k 

(b) 


Iteration  k 

(c) 


Figure  3.  (a)  Measurement  and  state  simulation,  (b)  modified  Gaussian  sum 
estimate  compared  with  state,  (c)  error  =  state  —  estimate. 


NONGAUSSIAN  ESTIMATION 


43 


3.5  Comparison  of  Gaussian  Sum  and  Modified  Gaussian 
Sum  Algorithms 

The  Gaussian  sum  (GS)  algorithm  of  Alspach  [6],  [7]  and  the  modified 
Gaussian  sum  (MGS)  algorithm  developed  in  this  present  work  exhibit  several 
similarities.  Among  them  are  using  a  Gaussian  sum  density  approximation  to 
model  the  actual  nonGaussian  densities,  a  structure  formed  by  a  bank  of  Kalman 
filters  operating  in  parallel,  and  weighting  terms  that  are  nonlinear  functions  of  the 
measurement  data.  However,  there  are  several  key  differences  between  the  two 
algorithms  that  set  them  apart. 

One  difference  is  that  the  GS  algorithm  produces  an  exponentially  increasing 
number  of  Gaussian  terms  at  each  iteration,  while  the  MGS  algorithm  produces 
only  a  fixed  number  of  Gaussian  terms.  As  a  result,  the  GS  algorithm  requires  an 
exponentially  increasing  number  of  individual  Kalman  filters  operating  in  parallel, 
which  is  impossible  to  implement  for  any  practical  purpose.  The  MGS  algorithm 
avoids  the  increasing  computational  and  storage  requirements  of  the  GS  edgorithm. 

This  leads  to  a  second  difference  between  the  two.  The  GS  algorithm  is 
optimal  while  the  MGS  algorithm  is  suboptimal.  Tugnait  [55]  and  Raisch  [56] 
point  out  that  in  order  for  the  state  estimate 

N  M 

Xk+i  =  £[ifc+,  I  Zk+i]  =  £  Sifc-Vi  I  ^ifc+i)  (3-5-1) 

.=1  >=i 

to  be  optimal,  the  full  exponentially  increaising  number  of  individual  Kalman  filters 
is  required.  This  leads,  in  turn,  to  the  condition  that  the  individual  estimate 

®i+i  =  a:fc+,p(ifc+i  1  Zk+i,ai,bj)  (3.5.2) 
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is  optimal  only  if  the  density  p(a:it+i  |  is  conditioned  on  the  full 

measurement  sequence  ^jt+i  =  switching-parameter 

bias  term  sequences  =  {aj,a2,...,ajt+i}*  -^£+1  =  {^n^2i- •  of  present 

and  past  bias  values  for  this  particular  estimate.  Therefore,  (3.5.2)  would  be 
written  as 


~  ^*:+lP(^/fc+l  I  ^^fc+l  (3.5.3) 

J  —  OO 

and  the  overall  state  estimate  (3.5.1)  would  be 

^it+i  ~  ^  ^fc^+i  P(-^l:+n-®i+i  I  ■^fc+i)  (3.5.4) 

i=i  i=i 

The  weighting  term  \  Z^+i)  is  also  now  a  function  of  the  full  bias  term 

sequences.  The  upper  limits  on  the  summations  are  no  longer  fixed  constants,  but 
are  now  functions  of  the  time  iteration.  At  each  iteration,  a  is  randomly  selected 
from  a  fixed  set  of  N  discrete  values.  This  produces  possible  Aj^+i  sequences 

at  iteration  fc-l-1,  with  i  running  from  Similarly,  h  is  randomly 

selected  from  a  fixed  set  of  M  discrete  values,  producing  possible  B](+i 

sequences,  with  j  rimning  from  1,2,...,M*''*'^. 

Rewriting  the  measurement  sequence  eis  Z^^.,  =  {^k+nZ^},  and  the  bi2is  term 
sequences  as  =  {0^+1,^^}  and  the  density  of  (3.5.3)  is 

expanded  as 


P{xk+i  I  ^k^uZk,ai+i,Ai,bi+i,Bi)  = 


AT  AT^lVaAr^ 


(3.5.5) 


where 
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iVj  =  p(2/t+i  1  ^k+li<^k+U  Bi,  Z If) 

N2  =  P(afc+l  I  ^k+U  -^kiH+li  Bit  Z I.) 

^3  —  P{^i+l  I  ®t+H  ^fc)  ^i) 

^4  =  PiXk+l\^k,Bi-,Zk) 


B\  —  p(^/l:+l  I  ®fc+l5-^fc>^i+l»-®i?^fc) 

D2  =  p(afc+i  1  -^kiH+ii  Bi,  Z  ic) 

B3  =  p(fci+i  I  Ak-,Bi,Zk) 


The  iVj  term  can  be  reduced  to  the  Markov  transition  probability  N2  =  pi^k+i  \  o,].) 
since  the  is  a  semi-Markov  sequence.  The  N3,  D2,  and  can  be  reduced  in 
a  similar  fashion,  producing 

■^2  =  pK+1  I  4),  ^3  =  P{H+i\H) 

B2  =  P(4+1 1 4),  B3  =  p(6i+i  I  bi) 

Since  iVj  =  -^2  ^3  =  cancel  out  in  (3.5.5)  leaving 

p(®fc+i  I  ^ k+ii ^ k't^^k+ii ■^kiH+i^ Bi)  = 

P(^t+i  I  Xk+iA+vAlbi+i^BiZt)p{xt+i  I  ai,Ai_^,blBi.^,Zk)  .  . 

P(^fc+l  I  *^k+V  ^kiH+\^  Bi,  Z  f.) 

The  density  function  of  (3.5.2)  used  in  the  MGS  algorithm  is  an  approximation  to 
(3.5.6),  since  it  eliminates  the  past  bias  term  sequences  in  (3.5.6).  Using  a  simil2U^ 
development,  the  weighting  term  of  (3.5.1)  used  in  the  MSG  algorithm  is  an 
approximation  to  the  weighting  term  of  (3.5.4),  since  the  past  bias  term  sequences 
are  eliminated  here  also.  The  MGS  algorithm  uses  only  the  information  contained 


NONGAUSSIAN  ESTIMATION 


46 


in  the  present  bias  values,  thereby  leading  to  a  suboptimal  state  estimate  (3.5.1). 

Even  though  the  GS  algorithm  produces  an  optimal  state  estimate,  it  cam 
never  be  found  in  practice  since  the  number  of  GS  terms  increases  exponentially 
from  one  iteration  to  the  next  according  to  (3.1.15)  and  (3.1.22),  combined  below 

=  QkrkCk  (3.5.7) 

In  order  to  reduce  this  growing  number  to  some  prescribed  fixed  number  E’  at  each 
iteration,  Alspach  [6]  proposes  a  suboptimal  estimate  based  on  observed 
mechanisms  that  allow  terms  to  be  neglected  or  combined.  At  iteration  k,  he  drops 
weighting  terms  (3.1.16)  and  (3.1.23)  that  fall  below  some  prescribed  threshold.  He 
also  combines  several  GS  terms  of  (3.1.11)  into  one  term  if  their  means  (3.1.12)  and 
variances  (3.1.13)  have  become  approximately  equal.  These  two  operations  are 
used  until  only  =  E'  significant  terms  remain.  Then,  at  iteration  ^+1,  the 
number  of  GS  terms  grows  again  to  the  larger  number  according  to  (3.5.7),  is 
reduced  by  eliminating  and  combining  terms  until  ^Jid  the  cycle  repeats 

for  all  subsequent  iterations.  This  method  results  in  a  substantial  reduction  in  the 
number  of  GS  terms  with  a  neghgible  effect  on  the  p(xfc^i  I  ^fc+i)  density 
approximation. 

On  the  surface,  it  appears  that  Alspach’s  suboptimal  Gaussian  sum  (SGS) 
algorithm  can  achieve  performance  equal  to  that  of  the  MGS  algorithm,  since  only 
a  fixed  number  of  terms  is  used  to  produce  the  state  estimate.  However,  in  order 
for  the  SGS  algorithm  to  produce  the  fixed  number  of  terms  E',  the  larger  number 
of  terms  q^ricE'  must  first  be  generated,  requiring  gtr^E'  parallel  filters.  The  MGS 
algorithm  requires  only  E'  filters  at  each  iteration  to  produce  a  E'  term  state 
estimate.  The  computational  savings  of  the  MGS  algorithm  is  considerable.  Even 
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if  the  plant  and  measurement  noise  processes  (3.1.5)  and  (3.1.6)  were  approximated 
using  the  minimum  number  of  two  terms  each  {qi,,  =  r^  =  2),  the  SGS  algorithm 
would  require  a  number  of  filters  =  4  times  greater  than  the  MGS  algorithm. 

A  third  difference  is  that  the  GS  algorithm  must  always  use  time-varying 
Kalman  gains  while  the  MGS  algorithm  allows  the  use  of  steady-state  Kalman 
gains.  The  growing  memory  problem  of  the  GS  algorithm  prevents  steady-state 
Kalman  gains  to  be  calculated  off-line  before  running  the  state  estimator.  The  SGS 
algorithm  must  also  use  time-varying  gains  because  the  elimination  and 
combination  of  terms  is  performed  at  each  iteration.  Therefore,  the  Kalman  gains 
could  never  be  computed  in  advance.  The  MGS  algorithm  uses  the  same  fixed 
number  of  filters  at  each  iteration.  With  constant  covariances  Q  and  R  of  the 
plant  noise  (3.3.2)  and  measurement  noise  (3.3.4)  used  in  the  gain  equations  for 
each  filter,  steady-state  Kalman  gaiins  could  be  calculated  off-line.  Even  if  Q 
and  R  were  different  for  each  filter,  they  would  then  be  modeled  as  semi- 
Markov  processes,  with  values  randomly  selected  from  fixed  sets  of  discrete  values. 
Since  all  the  covariance  values  would  be  known  in  advance,  steady-state  Kalman 
gains  could  again  be  calculated  off-line. 

A  fourth  difference  between  the  two  algorithms  is  the  use  of  Markov  tremsition 
probabilities  in  the  weighting  term  of  the  MGS  algorithm,  restated  here  for 
convenience 


0=1  /J=l 


The  weighting  term  of  the  GS  algorithm  at  iteration  fc-fl  is 


Olk+l 


Ok+lpk+lJ^k+lj 
2  (^k+ipk+iJ^k+ij 


(3.5.8) 


(3.5.9) 
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Define  the  scale  factor  as 


K+i  =  ^ -  (3.5.10) 

51  O'fc+i-Tfc+i^^fc+i 

i=l  •' 

Using  the  definition  of  the  term 

awi,-  =  “»A„  (3-5.11) 


and  (3.5.10)  in  (3.5.9)  results  in 

^k+lj  =  (3.5.12) 

The  GS  weighting  term  (3.5.12)  now  has  a  similar  recursive  form  as  the  MGS 
weighting  term  (3.5.8),  with  and  C%^i  being  the  respective  scale  factors,  ^md 
Nfzfc+n  ^fc+i]  being  the  respective  Gaussian  density  fimctions  based  on 
the  measurement  data.  The  main  difference  between  the  two  weighting  terms  is 
the  Markov  transition  probabilities  in  (3.5.8)  and  the  'ik+i^Pkn  f^ictor  in  (3.5.12). 
The  Markov  probabilities  govern  the  chances  that’ a  bias  term  is  going  to  switch 
from  one  value  to  another.  The  'ik+\^Pk„  factor  is  formed  from  the  weighting 
terms  of  the  Gaussian  sum  approximations  of  the  nonGaussian  densities  (3.1.5)  and 
(3.1.6),  and  has  no  meaning  simileu:  to  the  M£Lrkov  probabilities.  Of  course,  the 
current  number  of  Qk+i^  terms  is  larger  than  the  previous  number  of  terms  at 
the  each  iteration,  while  the  number  of  wjt'Vi  terms  remains  fixed  at  each  iteration. 
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4.0  APPLICATION  TO  A  MODAL  SYSTEM 


The  modified  Gaussian  sum  (MGS)  adaptive  filtering  technique  of  chapter  3  is 
now  applied  to  a  modal  system  problem.  Large,  heavily  damped  structures 
modeled  by  a  plant  with  complex  modes  are  considered  in  [1],  The  input  to  the 
plant  is  unknown,  but  its  frequency  characteristics  are  known  and  the  input  is 
always  present.  The  nonGaussian  nature  of  the  input  signal  is  modeled  using  the 
signal  model  of  chapter  3,  and  the  MGS  adaptive  filter  for  this  system  is  developed. 
A  simulation  example  using  the  MGS  adaptive  filter  is  given  and  the  results  are 
compared  to  those  produced  from  an  augmented  Kaimm  filter  bcised  on  a  system 
model  assuming  a  narrowband  Gaussi£tn  input  signal.  A  necessary  condition  for 
effective  estimation  is  derived.  Alternate  estimation  procedures  eire  developed  to 
compensate  for  situations  when  this  condition  is  not  met.  Several  configurations 
are  simulated  and  their  performance  results  are  analyzed  and  compared. 


4.1  Modal  System  and  Filter  Development 

The  MGS  algorithm  is  applied  to  a  modal  system  with  a  nonGaussian  input 
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signal  and  Gaussian  plant  and  measurement  noise.  The  theoretical  foimdation  for 
this  system,  which  serves  as  a  model  for  large,  heavily  damped  structures,  is  found 
in  [1]  and  briefly  outlined  in  appendix  A.  Using  a  zero-order-hold  model,  the 
resulting  discrete-time  system  equations  with  added  noise  terms  are 


-f  rufc  + 

(4.1.1) 

Zk  =  Hxk  +  DiXfc  -1-  Ufc 

(4.1.2) 

‘^k  —  K  +  "Jk 

(4.1.3) 

where  is  the  state  vector 

is  the  measurement  vector 

W/c  is  a  zero  mean  white  Gaussian  plant  noise  process  with 
covariance  Q 

Vf.  is  a  zero  mean  white  Gaussian  measurement  noise  process 
with  covariance  R 

$,  r,  H,  D  are  the  respective  constant  transition  matrices 

Ufc  is  the  vector  Gaussian  sum  (GS)  signal  model  of  the  actual 
nonGaussian  input  signal,  comprised  of  semi-Markov  bias 
vector  zero  mean  white  Gaussian  noise  with 

semi- Markov  covariance  5jt 

and  the  random  quantities  x^,  *ire  eissumed  to  be  mutually 

statistically  independent. 

There  axe  two  importcint  differences  between  the  modal  system  of  (4.1.1)  - 
(4.1.3)  and  the  general  system  of  chapter  3,  (3.3.1)  -  (3.3.4).  The  first  difference  is 
that  in  the  modal  system,  the  signal  i^  ^  input  not  only  to  the  plant,  but  is  adso 
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fed  through  to  the  measurement  Zf..  Inserting  (4.1.3)  into  (4.1.1)  and  (4.1.2) 
produces  modified  plant  and  measurement  equations 


where 


Xk+1  =  ^^k  +  rftfc  +  ’^w'k 

(4.1.4) 

Zk  =  Hxt  +  Dbk  +  v'k 

(4.1.5) 

w'k  =  Tnjt  + 

(4.1.6) 

v'k  =  Drik  +  ujt 

(4.1.7) 

Because  n*.  appears  in  both  (4.1.6)  and  (4.1.7),  injj.  and  v'h.  are  correlated  Gaussian 
random  processes.  At  first  glance,  this  correlation  may  seem  to  be  an  obstacle  in 
developing  the  Kalman  filter  equations  for  this  model,  since  the  Kalman  filter 
development  requires  that  the  plant  and  measurement  noise  processes  be 
uncorrelated.  However,  what  is  really  necessary  in  this  situation  is  for  Wk_i  and  t;Jt 
to  be  uncorrelated  [57].  Rewriting  (4.1.4)  for  gives 

=  +  +  (4.1.8) 

and  substituting  (4.1.8)  into  (4.1.5)  gives 


zt  =  +  r6jfc_i  +  ^tnjt-i)  +  Dbk  +  v'^  (4.1.9) 

thereby  showing  that  any  covariance  calculations  involving  (4.1.9)  will  involve  the 
correlation  between  and  vj^,  and  not  between  u;^  and  The  correlation 

between  amd  ujt  is  given  by 
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E[Wk.iv'i^]  =  i?[(rnfc_i  +  +  v^Y] 

=  TE[n,_,nl]D-^  +  +  <i!E[w,_,nl]D^  +  ^E[w,_,vl]  (4.1.10) 

=  0 

since  w^.,  Ujt,  and  axe  assumed  to  be  mutually  statistically  independent. 

The  second  difference  is  that  the  covariance  5^  of  the  GS  signal  model  is  no 
longer  constant,  but  can  vary  with  time.  The  bias  vector  b^.  and  covariance  axe 
both  semi-Markov  processes  with  state  transitions  governed  by  a  single  Markov 
transition  probability  matrix.  That  is,  values  of  6^  and  5^  will  be  paired  together 
and  will  randomly  switch  from  one  pair  to  another.  The  switching  covariance  can 
be  thought  of  as  changing  the  power  of  the  Gaussian  process. 

As  in  chapter  3,  the  optimal  estimate  of  the  state  vector  is  found  by 

minimizing  the  mean-square  error  between  and  This  results  in  the 

conditional  mean  estimate 

,oo 

Xk+i  =  E[xk+i  I  =  Xfc+ip(xfc+i  I  Z^+i)  dxk+i  (4.1.11) 

J  —  OO 

where  is  the  current  measurement  sequence  {zi,Z2i--->^k+i}-  The  conditional 
density  function  of  (4.1.11)  can  be  written  as  the  ratio  of  the  corresponding  joint 
and  maxginal  densities 


I  Zn-.) = 

The  bias  vector  6*  and  covariance  matrix  5*.  axe  explicitly  brought  into  (4.1.12)  by 
considering  the  joint  density  p{xk+i,Zif^i)  to  be  a  maxginal  density  found  from 
p{xk+vZii^i,bk  =  6„5fc  =  Si)  by  summing  over  the  5^  terms 
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(4.1.13) 


P(a:n.i  I  Zt+i)  = 


M 

53  P(^k+U  ^k+H^k 

t=l 


=  6,,Sfc  =  S.) 


Pi^k+l) 


After  some  additional  algebraic  manipulation,  the  optimal  estimate  of  (4.1.11) 
becomes 


M 

^k+i  =  I  ^fc+i)  (4.1.14) 

i=l 

where  is  the  conditional  mean  estimate  of  given  that  =  b,  and  5^  =  5„ 
denoted  by 

f°° 

4+x  =  Xk+ip{xk+i  I  Zt+1,6,,5.)  dijt+i  (4.1.15) 

J  —  00 

and  the  weighting  factor  p(6„5, 1 2’jfc^.i)  is  the  probability  that  6*  =  6,  and  5*  =  5, 
given  the  current  me<isurement  sequence.  The  overall  estimate  of  (4.1.14)  is  then  a 
weighted  sum  of  M  individual  estimates,  each  based  on  a  particular  pair  of 
parameters  b^  and  5^. 

Each  individual  estimate  (4.1.15)  is  found  by  a  modified  Kalman  filter.  Using 
the  same  method  in  chapter  3,  the  Kalman  filter  equation  for  (4.1.15)  is 

4+1  =  ^4  +  r6<  +  Ki^i[zk+i  -  Dbi  -  H{^xi  +  r6.)]  (4.1.16) 

with  covariance  and  gain  equations 

M’fc+i  =  +  TSiV^  +  (4.1.17) 

+  DStD^  +  R)-'  (4.1.18) 
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(4.1.19) 


PUi  =  {I-KWiH)MUi 

Note  that  (4.1.17)  -  (4.1.19)  are  now  dependent  on  5,  so  that  an  individual  set  of 
these  equations  has  to  be  computed  for  each  estimator  (4.1.16),  whereas  in 
chapter  3  only  one  set  was  computed  and  used  for  all  individual  estimators. 

As  in  chapter  3,  the  next  step  in  producing  the  overall  estimate  (4.1.14)  is  to 
find  the  weighting  term  p(6„5i  |  .^jt+i).  Using  Bayes’  theorem,  and  writing  as 
the  weighting  term  becomes 

O  1 ..  ^  I  ii>5i.2i)p(6„5, 1  Zj) 

The  denominator  term  is  independent  of  i  and  is  a  scale  factor  that  ensures  that 
the  sum  of  the  weights  (4.1.20)  at  each  iteration  is  equal  to  one.  The  first  term  of 
the  numerator  can  be  modeled  as  a  Gaussian  density  if  the  bias  and  covariance 
terms  switch  slowly  compared  to  the  sample  interval  k.  This  is  the  same 
assumption  made  in  chapter  3.  This  conditional  density  is 

p(z*+i  1 6,,5,,Zfc)  =  +  Tb,)  +  Dbi,  HMU,H^  +  DSiD^  +  R]  (4.1.21) 

The  mean  and  covariance  of  (4.1.21)  are  readily  available  from  the  Kalman  filter 
equation  (4.1.16)  and  the  Kalman  gain  (4.1.18). 

The  second  term  of  the  numerator  can  be  rewritten  to  explicitly  include  the 
sample  time  for  the  bias  and  coveuriance  terms 

I  ^k)  —  pi^k+l  —  ^i^^k+1  —  I  ^k)  (4.1.22) 
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Using  Bayes’  rule  and  the  definition  of  marginal  densities,  (4.1.22)  is  expanded  into 


M 

I  ^k)  —  ^.p{^k+l—^ii  ^k+l—^i  I  ^k—^ji^k—^j)Pi^k—^ji^k—^j  I  ^k)  (4.1.23) 

J=1 

As  in  chapter  3,  the  following  notation  is  used  to  express  the  terms  of  (4.1.23) 

~  p{^k+l  ~  ^ii^k+l  —  ^i\^k  —  ^j^^k  —  ^  j)  (4.1.24) 

H  =  Pih  =  h^Sk  =  Sj\Zk)  (4.1.25) 

The  term  (4.1.24)  is  a  Markov  transition  probability  [22];  that  is,  is  the 
conditional  probability  that  6  =  6,  and  5  =  5,  at  time  A:  +  l,  given  that 
b  =  bj  and  5  =  5j  at  time  k.  The  term  (4.1.25)  is  of  the  same  form  as  (4.1.20)  and 
is  just  the  previous  weighting  term  at  the  previous  time  interval  k. 

Combining  (4.1.21),  (4.1.24),  and  (4.1.25),  the  weighting  term  (4.1.20)  is 
written  as 


M 

w'fc+i  =  PihSi  1  Zfc+i)  =  C2+iP(zfc+i  1 6j,5i,Zfc)2^i.W  (4.1.26) 

>=i 

where  C%^i  is  a  scale  factor  determined  at  each  time  interval  such  that  the  sum  of 
all  the  weighting  terms  (4.1.26)  is  equal  to  one. 

The  equations  for  the  modified  Gaussian  sum  adaptive  filter  for  the  system  of 
(4.1.1)  -  (4.1.3)  are  summarized  in  Table  3.  The  structure  of  the  overall  adaptive 
filter  is  again  a  bank  of  Kalman  filters  operating  in  par2dlel,  with  each  individual 
estimate  multiplied  by  its  own  corresponding  weighting  term.  The  estimator 
based  on  the  bias  and  covariance  pair  that  most  closely  matches  the  actual  bias 
and  covariance  pair  of  the  modeled  system  will  have  a  corresponding  weighting 
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term  that  tends  closer  to  one,  while  the  weights  of  the  other  mismatched 
estimators  will  tend  towards  zero.  A  block  diagram  of  the  MGS  adaptive  filter  for 
this  modal  system  is  shown  in  Figure  5. 


Table  3. 

Modified  Gaussian  sum  adaptive  filter  equations  for  a  modal  system. 

System:  +  Tui.  + 

2/t  =  Hxk  +  Duf^  +  Vfc 

Ut  =  +  Mfc 

M 

Overall  estimate:  5/t+i  =  Si  I 

i=l 

Kalman  filter  equation:  x]t+i  =  +  r6,  +  A’i+Jz/t+i  —  Dbi  —  H{^x\  +  r6,)] 

Kalman  gain  equations:  +  rS.F^  + 

KUi  =  +  DSiD^  + 

PU  =  (i-KUH)AfU 

Weighting  term:  wi+i  =  p(&„  S,  ]  Z^+j)  =  C2+ip(«fc+i  1  6.,  S.,  Z*)  e\,^sH 

>=i 

with  1 6,,  5„  Zfc)  =  N[ff($x-  +  r6,)  +  Db„  +  R] 

^bi  —  P(^k+i  —  ^ii  Sk+1  —  Si  I  bic  =  bj,  Sk  =  Sj) 

=  P{h  =  5*  =  S,  I  Zjfc) 

M 

and  scale  factor  such  that  ^  =  1 


APPUGATION  TO  A  MODAL  SYSTEM 


57 


Figure  5.  Modified  Gaussian  sum  adaptive  filter  structure  for  a  modal  system. 
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4.2  Simulation  Example 


An  example  applying  the  MGS  algorithm  to  a  modal  system  with  a 
nonGaussian  input  signal  is  now  presented.  The  results  are  compared  to  those 
produced  from  an  augmented  system  model  assiiming  that  the  input  signal  is 
Gaussian.  A  second-order  system  is  used,  modeled  by  the  transfer  function 


G(s)  = 


-1-  cs  -I-  d 
+  as +  b 


and  in  continuous-time  state  space  form  as 


x{t)  =  Ax{t)  Bu{t) 
y{t)  =  Cx{t)  -f  Du{t) 


(4.2.1) 


(4.2.2) 

(4.2.3) 


The  system  matrices  are  defined  by 


where  the  resonant  firequency  of  the  system  is  /„  =  20  Hz  aind  the  damping 
coefficient  is  =  0.1  so  that 


B  = 


0 

1 


C  =  [d-6  c-a]  ^  =  [l]  (4-2.4) 


a  =  2C(27r/J  6  =  c  =  5a  d  =  5b  (4.2.5) 

Using  a  zero-order-hold  model  with  a  sample  time  of  T  =  ^  second,  the  equivalent 
discrete-time  system  with  added  noise  terms  becomes 
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*t+i  =  **»+rtit+*u)i 


(4.2.6) 


Zic  =  Hxj^  +  Dufc  +  Ujt 


(4.2.7) 


where 


$  = 

-0.3271 

0.0057 

r  = 

0.0001 

1 

-89.2411 

-0.4691 

0.0057 

100 

(4.2.8) 

H  =  [  63165  100.53  ]  ^  =  [  1  ] 


The  plant  noise  iw^  is  zero-mean  white  Gaussian  with  covariance  Q  =  10“*°,  and  is 
uncorrelated  with  measurement  noise  which  is  also  zero-mean  white  Gaussian 
with  covariance  R  =  10“^. 

The  nonGaussian  input  signal  is  a  stochastic  FM  signal  generated  by  frequency 
modtilating  a  sinusoid  with  a  Gaussian  process.  The  form  of  the  signal  is 


u{t)  =  sm{2wfj  -I-  J  *m(r)  dr)  (4.2.9) 


where  the  amplitude  is  =  5,  the  carrier  frequency  is  /„  =  2  Hz,  the  modulation 
index  is  =  10,  and  m(t)  is  zero-mean  white  Gaussian  noise  with  variance  1. 
Figure  6a  shows  the  FM  signal  «(<)  for  1  second.  The  probability  density  function 
of  u{t)  has  the  form  [58] 


p(u) 


l«l< 
I « I  > 


(4.2.10) 


Figure  6b  shows  a  plot  of  the  density  function  of  (4.2.10),  while  Figure  6c  shows  a 
normalized  histogram  of  a  10  second  sample  of  u{t).  Note  how  closely  the 
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histogram  matches  the  actual  deusity  function. 

The  nonGaussian  input  signal  u{t)  of  (4.2.9)  will  be  modeled  in  discrete-time 
using  the  Gaussian  sum  (GS)  signal  model  described  in  section  4.1 

Uk  =  bk  +  nk  (4.2.11) 

with  GS  density  approximation  described  in  chapter  3 

PM=l:PiN|i.i,SJ  (4.2.12) 

»=1 

Figure  7  compares  the  nonGaussian  density  function  (4.2.10)  of  the  stochastic  FM 
input  signal  with  a  three-term  GS  density  approximation  (4.2.12)  minimizing  the 
norm  (3.2.11).  Table  4  lists  the  values  of  P,-,  6^,  and  5,  used  in  the  Gaussian 
sum  approximation  and  the  MGS  adaptive  filter. 

Table  4. 

Gaussian  sum  P,-,  6„  and  S,  values. 


It  is  assumed  that  the  nonGaussian  input  signal  u(t)  is  much  larger  than  both 
the  Gaussian  plant  noise  and  the  Gaussian  measurement  noise  Vf^.  However, 
allowing  the  power  of  u(t)  to  be  ljuger  than  the  power  (covariance  Q)  of  to*  and 
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power  (covaxiance  R)  of  is  not  enough  to  ensure  good  MGS  simulation  results. 
The  nonGaussian  input  signal  u{t)  is  modeled  by  u*  (4.2.11),  which  will  take  on 
different  6,,  S,-  pairs  at  various  times.  The  lowest  value  of  5,-  from  Table  4  is 
5j  =  0.04.  From  covariance  equation  (4.1.18)  and  Kalman  gain  (4.1.19),  repeated 
here  for  convenience 


Ml^.1  =  + rs.r^ + 

KUv  =  B )-' 

it  is  apparent  that  for  good  MGS  simulation  results,  covariances  Q  and  R  must  be 
selected  such  that  the  elements  of  are  larger  than  the  elements  of  and 

the  elements  of  DS^D^  are  larger  than  the  elements  of  R.  This  specification  is  met 
using  5,  =  0.04,  Q  =  10~^°,  R  =  10"^,  and  F,  'i’,  and  D  from  (4.2.8),  giving 


2.8  X 10-1° 

1.9  X 10-® 

1.0x10-1° 

1.0  X  10-« 

1.9  X 10-* 

1.3  X 10-° 

1.0  X 10-® 

1.0  X  10-° 

DSiD^  =  0.04  R  =  0.01 

For  comparison  purposes,  the  nonGaussian  input  signal  u{t)  of  (4.2.9)  will  also 
be  modeled  as  a  narrowband  Gaussian  signal,  generated  by  passing  white  Gaussian 
noise  through  a  linear,  time-invariant  filter.  This  shaping  filter  is  designed  using 
the  frequency  characteristics  of  u(<).  Figure  8  shows  the  power  spectral  density  of 
«(/)  and  the  power  spectral  density  of  a  second-order  shaping  filter,  modeled  by  the 
transfer  function 


GM 


N„,s 


(4.2.13) 
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25i 


Figure  8.  Stochastic  FM  signal  and  shaping  filter  power  spectral  densities. 
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and  in  continuous-time  state  space  form  as 


in6(<)  =  >ln6a^n6(<)  +  -Bn6«’n6(<)  (4.2.14) 

-  C^n6^n6(<)  (4.2.15) 

The  system  matrices  are  defined  by 

0  1  0  r  1 

C„fc=  0  1  (4.2.16) 

~°nb  ~^nb  J  L  ^  J  ^  ^ 

with  resonant  frequency  /„j,  =  2  Hz  and  damping  coefficient  Cn6  =  ^  such  that 

a„6  =  2C„6(27r/„,)  =  4  b„,  =  =  157.91  (4.2.17) 

and  w„f,  is  zero-mean  white  Gaussian  noise  with  variance  N'^i,  =  (120)^. 

The  Gaussian  input  model  of  (4.2.15)  is  now  assumed  to  model  the 

stochastic  FM  input  «(t)  of  (4.2.9)  by  augmenting  the  system  equations  (4.2.2)  and 
(4.2.3)  with  the  shaping  filter  equations  (4.2.14)  and  (4.2.15),  giving 


ia(<)  =  +  B^w„l{i)  (4.2.18) 

ya{t)  =  CMt)  (4.2.19) 


The  system  matrices  of  (4.2.18)  and  (4.2.19)  are  defined  by 


Using  a  zero-order-hold  model  with  a  sample  time  of  T  =  ^  second,  the  equivalent 


APPUCATION  TO  A  MODAL  SYSTEM 


66 


discrete-time  augmented  system  with  added  noise  terms  becomes 


Xk+i  =  -1- 

=  H^xl  +  Vk 


(4.2.21) 

(4.2.22) 


where 


-0.3271  0.0057 

-89.2411  -0.4691 
0  0 

0  0 


-0.00009  0.00008 
-0.0129  0.00524 

0.9786  0.0160 

-2.5275  0.9146 


H.=[ 


63165  100.53 


(4.2.23) 


The  first  plant  noise  term  is  the  discrete-time  version  of  the  continuous-time 
plant  noise  of  (4.2.14),  with  new  covariance  found  from  [5]  as 


J  0 


Qnb  — 


0.00002 

0.0024 

0.0047 

0.0374 

0.0024 

0.0329 

0.0564 

55.85 

0.0047 

0.0564 

1.048 

92.22 

0.0374 

55.85 

92.22 

11194 

(4.2.24) 


The  second  plant  noise  term  lo^  and  the  measurement  noise  are  the  same  as 
those  foimd  in  (4.2.6)  and  (4.2.7).  The  three  processes  wi^,  and  assumed 
to  be  mutually  statistically  independent. 
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The  Kalman  filter  based  on  the  augmented  system  (4.2.21)  and  (4.2.22),  is 
given  by 

(4.2.25) 

with  covariance  and  gain  equations 


(4.2.26) 

A'St,  =  +  R)-' 

(4.2.27) 

(4.2.28) 

Note  that  in  (4.2.25)  the  measurement  zi^  produced  from  (4.2.6)  and  (4.2.7)  using 
the  actual  nonGaussian  input  signal  (4.2.9)  is  used  instead  of  the  measurement  2% 
from  the  augmented  system  (4.2.21)  and  (4.2.22). 

The  MGS  filter  is  initialized  with  equally- valued  weighting  terms 

wi  =  for  j  =  1,2,3  (4.2.29) 

A  Markov  transition  probability  matrix  0^^,  consisting  of  elements,  is 
configured  with  a  high  probability  that  the  bias  term  does  not  switch  from  one 
value  to  another,  and  a  low  probability  that  the  bias  term  does  switch,  such  as 

.95  .025  .025 

01^5=  025  .95  .025  (4.2.30) 

.025  .025  .95 

The  initial  values  of  the  state,  state  estimate,  and  error  covari2mce  for  the 

MGS  adaptive  filter  are 
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0 

0.0004 

10000  0 

Xq  - 

0 

0.0135 

1 

o 

t— • 

o 

o 

o 

o 

(4.2.31) 


and  for  the  augmented  Kalman  filter  are 


0 

0.0004 

10000 

0 

0 

0 

0 

xsa  _ 
Xq  — 

0.0135 

Pt  = 

0 

10000 

0 

0 

0 

0 

0 

0 

10000 

0 

0 

0 

0 

0 

0 

10000 

(4.2.32) 


Figure  9a  shows  the  first  state  and  the  overall  state  estimate  xjt+i  from  the 
MGS  adaptive  filter,  while  Figure  9b  shows  the  first  state  xj^+i  and  the  overall 
state  estimate  x^+i  from  the  augmented  Kalman  filter.  Figure  10a  and  10b  show 
similar  estimation  results  for  the  second  state  xl^^.  The  MGS  adaptive  filter 
performs  very  well,  with  its  estimates  tracking  the  states  almost  exactly.  The 
augmented  Kalman  filter  however  introduces  considerable  delay  in  its  estimates. 
Table  5  shows  a  normalized  mean-square-error  percentage  measure  for  the  state 
estimates  produced  by 


EUx  -  xl^l 

mse  %  =  X 100  (4.2.33) 

The  mse  percentages  for  the  augmented  Kalman  filter  are  considerably  higher  than 
the  mse  percentages  for  the  MGS  adaptive  filter  due  to  the  delay  in  the  augmented 
Kalman  filter  estimates.  Figure  11a  shows  the  weighting  terms  for  each  of  the 
three  6^,  5,  pairs.  Figure  11b  compares  the  three  weighting  terms  with  the 
stochastic  FM  input  signal,  showing  how  the  weights  switch  according  to  the 
amplitude  of  the  input  signal. 
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Time  (/c7) 

(a) 


Time  {kl) 

(b) 


Figure  9.  (a)  State  1  and  MGS  adaptive  filter  estimate, 
(b)  state  1  and  augmented  Kalman  filter  estimate. 
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Time  (kT) 

(a) 


Time  (/c7) 

(b) 


Figure  10.  (a)  State  2  and  MGS  adaptive  filter  estimate, 

(b)  state  2  and  augmented  Kadman  filter  estimate. 
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Table  5. 


Normalized  mean-square-error  percentage  for  state  estimates. 


State 

MGS 

Augmented 

filter 

filter 

1 

1.85 

14.76 

2 

3.35 

227.67 

mse  %  =  2f^  ^  ^ 

E[x^] 
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(a) 


Time  {kT) 

(b) 


Figure  11.  (a)  Weighting  terms,  (b)  weighting  terms  compared 
with  stochastic  FM  signal. 
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4.3  Necessary  Condition  for  Effective  Estimation 


The  conditional  density  (4.1.21),  which  appeeirs  in  the  weighting  term  (4.1.27) 
is  of  prime  importance  in  determining  whether  or  not  the  MGS  adaptive  filter  will 
work  properly  for  a  given  system.  As  previously  stated,  the  estimator  (4.1.16) 
based  on  the  bias  and  covariance  pair  that  most  closely  matches  the  actual  bias 
and  covariance  pair  of  the  modeled  system  will  have  a  corresponding  weighting 
term  that  tends  closer  to  one,  while  the  weights  of  the  other  mismatched 
estimators  will  tend  towards  zero.  The  bias  and  cov«iriance  pairs  influence  the 
weighting  terms  through  the  conditional  density  (4.1.21),  rewritten  here  eis  the 
n-dimensional  (2^+1  is  an  n  x  1  vector)  Gaussian  conditional  density 


p{2k^i\KSi,Zk)  = 


(27r)»/2|ij.^,|i/2 


(4.3.1) 


where 


iZl+i  =  +  DSiD^  +  R  (4.3.2) 

is  defined  as  the  measurement  covariance  from  K£Llman  gain  (4.1.18)  and 

rl+i  =  2*+i  -  Dhi  -  H{^xi  +  r6,)  (4.3.3) 

is  defined  as  the  i‘^  measurement  residual  from  Kalman  estimator  (4.1.16). 
Essentially,  the  measurement  residual  of  the  matched  filter  (with  the  bias  and 
covariance  pair  that  most  closely  matches  the  actual  pair  of  the  modeled  system) 
will  have  the  smallest  expected  value,  typically  zero  mectn.  The  residuals  of  all  the 
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other  mismatched  filters  will  be  biased.  Under  the  Gaussian  assumption,  the 
probability  of  the  matched  filter  will  be  the  largest  among  all  the  filters. 

In  the  previous  simulation  examples  of  chapters  3  and  4,  this  method  of 
calculating  the  conditional  measurement  probabilities  (4.3.1)  and  estimator 
weighting  terms  (4.1.26)  works  very  well,  producing  very  accurate  state  estimates. 
However,  in  the  case  of  the  force  to  acceleration  modal  system  described  in  [1]  and 
by  (A. 5)  of  appendix  A,  this  method  did  not  perform  well.  Upon  analysis  of 
several  simulations,  it  appeared  that  the  statistical  steady-state  value  (that  is,  the 
steady-state  value  of  the  expected  value)  of  the  measurement  residual  of  each 
filter  converged  to  zero.  Only  the  correctly  matched  filter  is  supposed  to  have  a 
statistical  steady-state  (SSS)  value  of  zero,  while  the  residuals  of  the  mismatched 
filters  are  supposed  to  be  biased.  Yet,  in  this  case,  none  of  the  residuals  were 
biased.  Therefore,  the  MGS  algorithm  could  not  detect  which  filter  possessed  the 
correct  bias  and  covariance  pair,  and  the  weights  of  all  the  filters  converged  to  the 
same  value. 

Upon  investigation,  it  has  been  determined  that  the  SSS  value  of  the 
measurement  residual  (4.3.3)  is  not  only  a  function  of  the  bias  term  6„  but  is  also  a 
fimction  of  the  dc  gain  of  the  system  transfer  function  from  to  2* 

GWL.=G...  =  -ffU-«r'r  +  i>  (4.3.4) 

The  dc  gains  of  the  systems  used  in  the  previous  simulations  of  chapters  3  and  4 
are  nonzero,  producing  different  SSS  residual  values  due  to  the  different  bias  and 
covariance  pairs.  As  shown  in  appendix  A,  the  dc  gain  of  the  force  to  acceleration 
modal  system  is  zero,  producing  equal  SSS  residual  values  despite  differences  in  the 
bias  and  covariance  pairs. 
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In  order  to  show  the  dependency  of  the  SSS  residual  value  upon  the  system  dc 
gain,  first  rearrange  the  Kalman  filter  equation  (4.1.16)  as 

4+1  =  (/  -  KUiHmi  +  Tb,)  +  -  Dh,)  (4.3.5) 

and  take  the  expected  value 

=  (1  -  + rti) + ii:i+i(£|2.+.j  -  C6,)  (4.3.6) 

Equation  (4.3.6)  requires  the  expected  value  of  the  measurement  model  (4.1.2)  at 
iteration  fc+l,  given  as 


•^[^it+i]  —  H  -^l^ifc+i]  +  ^^k+i  +  -^bfc+i]  (4.3.7) 

with  Ui^^i  being  the  actual  input  signal,  ^lssumed  known  for  this  development. 
Substituting  the  plant  model  (4.1.1)  for  in  (4.3.7)  gives 

^[^fc+i]  =  HE[^Xk  +  r«jt  +  +  Duk+i  +  ^[ufc+i] 

=  H^E[xk]  +  HTut  +  H'^!E[wi,]  +  Duk+i  +  .^[ufc+i] 

=  H^E[xt]  +  HTu^  +  (4.3.8) 

since  and  Ufc+j  are  each  zero  mean.  Substituting  (4.3.8)  into  (4.3.6)  produces 

£|*Uil  =  (/-ifu,H)(*£|4l+r6i)+ -C6,)  (4.3.9) 

Now,  let  (4.3.9)  reach  steady-state  with  =  xj,  and  =  u^  =  u. 
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The  SSS  value  of  (4.3.9)  is  then 


f  i.  =  (/  -  +  Tfe.)  +  +  HTu  +  Du-  Dhi)  (4.3.10) 

To  find  I,,,  the  SSS  value  of  the  state  model  (4.1.1),  first  find  the  expected  value 

+  Tufc  +  '5£[u)fc] 

=  +  Tut 

and  let  ^^[xjt+i]  =  ■E'fxfc]  =  and  =  «  at  steady-state 

-I-  Tu 

Solving  for  a:,,  gives 

x..  =  (/-$)-ir«  (4.3.11) 

and  substituting  (4.3.11)  into  (4.3.10)  produces 

xi.  =  (/  -  K\,H){iSfx^„  -t-  r6,)  -H  K\,Hm  -  $)-^ru  -h  K\,{HTu  +  Du-  Db^)  (4.3.12) 
Now,  the  $(/  —  $)~^  term  of  (4.3.12)  can  be  rewritten  as  the  identity 

$(/-$)-»=(/-$)-^-/  (4.3.13) 

since  post-multiplying  both  sides  of  (4.3.13)  by  (/  —  $)  shows  the  equality 
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#(/  -  $)-^(/  -$)  =  (/-  ^)-\I  -$)-(/-$) 
$=/-/+$ 


$  =  $ 


Substituting  (4.3.13)  into  (4.3.12)  and  expanding  gives 

=  {I-  +  {I-  KsH)n,  +  K\MI  -  ^)''ru 

-  K'„HTu  +  K\,HTu  +  K\,Du  -  K\,Dhi 


xi.  =  (7  -  )$x:,  +  (7  -  K\,H)Th,  +  7C,H(7  -  ^)-^Tu 

^K\,Du-K\,Db, 


(/  -  )$*;,  +  (/  -  K>„H)Thi  -  K‘„Db,  +  -  ^j-'r  +  Bia 


[/-(/-  =  (r  -  A';,frr  -  k\.d%  +  A;.[i/(/  -  $)->r + z))u 


or 


=  1^  -  (/  -  -  a;.(ot + D)%  +  A';.1»(/  -  *)-T + dy)  (4.3.14) 

With  x„  (4.3.11)  and  ij,  (4.3.14)  known,  the  SSS  value  of  the  measurement 
residual  (4.3.3)  can  now  be  found.  T£iking  the  expected  vedue  of  (4.3.3)  and  using 
(4.3.8)  produces 

E[rUi]  =  H^E[x,]  +  HTu,  +  -  Db,  -  -  HTb,  (4.3.15) 
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Letting  (4.3.15)  reach  steady-state,  with  =  r*,  and  U;t+i  =  the  result 

is 

ri.  =  +  HTu  +  Du-  Dhi  -  -  HTb,  (4.3.16) 

Using  (4.3.11)  and  (4.3.13),  H^x,^  is  rewritten  as 

H^x,,  =  -  $)-*ru  =  H{I  -  $)-'ru  -  HTu  (4.3.17) 

Substituting  (4.3.17)  into  (4.3.16)  produces  a  cancellation  of  terms 

ri,  =  H{I  -  $)-‘ru  -  HTu  -h  HTu  +  Du-  Dh^  -  H^x\,  -  HTb^ 

=  H{I  -  $)-‘r«  +  Du  -  Dbi  -  H^xi,  -  HTbi 

=  ifr(j  -  $)-^r  -h  D]u  -  (Hr  +  D)hi  -  H$xi.  (4.3. is) 

Using  (4.3.14)  in  (4.3.18)  gives 
ri,  =  {H{I  -  $)-'r  D]u  -  (Hr  +  D)6, 

- H$[/ -{I- H;.H)$]-H[r - H;.(Hr -i- d)]6, Hi,[H(7 - $)-'r -t d]u}  (4.3.19) 

The  dc  gain  Gj^.  (4.3.4)  appears  in  two  places  in  (4.3.19)  rewritten  as 

ri.  =  G,,U  -  (Hr  +  D)6,  -  H$[/  -{I-  K\,H)^]-^[T  -  K^HT  +  D)]b,  +  Kifi.^u} 

(4.3.20) 

However,  (4.3.20)  can  be  simplified  further  using  additional  algebraic  manipulation. 
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The  HT  4-  D  term  can  be  rewritten  by  combining  the  dc  gain  (4.3.4)  eind  the 
identity  (4.3.13)  as 


G,,  =  Hmi-^)-^  +  i]r  +  D 
=  H^I-^)-^T  +  HT  +  D 

so  that 

HT-\-D  =  Gj^-  -  $)-'r  (4.3.21) 

Substituting  (4.3.21)  into  (4.3.20)  and  rearranging  terms  gives 

-  Hi{i  -(I-  7f;./r)»|-'{[r  -  -  Hi(i  -  *)-'r)|6i + k‘„g^,u} 

r;.  =  -  GjA + mi  -  4)''r6, 

-/?$[/-(/-  j(:;.7r)*]-'(ri,  -  k‘„g^m + K\.mi  -  + a';,g,,«) 

-  mi  -(I-  + if;,OT(/  -  $)- ')r6, + a;,g.,.{u  -  6,))  (4.3.22) 

Next,  the  [/  —  (/  —  K\gH)^]~^  term  can  be  rearranged  as 

[/-(/-  K\,H)^]-^  =  [(/  -  $)  +  (4.3.23) 

and  factoring  out  an  (/  —  $)  term  produces 
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[(/  -  $)  +  K'„Hfb]-^  =  [(/  +  K\,H^{I  -  $)-!)(/  -  $)]-! 


(4.3.24) 


Using  the  identity 

{AB)-^  = 

if  A  and  B  are  nonsingular  square  matrices,  setting  A  =  I  +  K\,H^{I  —  and 
B  =  {I  —  (4.3.24)  is  equal  to 

[(/  +  K\  H^I  -  #)-')(/  -  $)]'^  =  (/  -  ^)-\I  +  -  $)-')-'  (4.3.25) 

Combining  (4.3.23)  -  (4.3.25)  therefore  results  in 

[/-(/-  =  {I-  $)-'(/  +  -  $)-')-'  (4.3.26) 

Substituting  (4.3.26)  into  (4.3.22)  gives 

-  mil  -  ^)-\I  +  -  $)-»)-M(/  +  K\,H^I  -  $)->]r6,  +  KJG^J^u  -  6,)} 

and  multiplying  through  gives  a  cancellation  of  terms 

ri.  =  -  6.)  +  HfSfil  -  $)-*r6.  -  -  $)-'r6. 

-  m[I  -  -  ^)-^)-^K\,G^,iu  -  6,) 

=  G,,(«  -  6,)  -  H^I  -  $)-»(/  +  -  ^r^)-^K\fi,,iu  -  6,) 

=  [/  -  mil  -  $)-'(/ + Ki,H^I  -  $)-')-*ii:;.]Grf,(«  -  6,)  (4.3.27) 
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Substituting  (4.3.26)  into  (4.3.27),  becomes  finally 

=  (4.3.28) 

The  SSS  residual  value  (4.3.28)  is  clearly  a  function  of  the  dc  gain  ^ 
well  as  the  bias  term  6,.  It  also  is  a  function  of  the  covariance  term  5,  through  the 
steady  state  Kalman  gain  A'*,.  As  a  result,  a  nonzero  dc  system  gain  becomes  a 
necessary  condition  for  effective  estimation. 

If  Gjc  is  nonzero,  then  r*,  will  be  different  for  each  bias  and  covciriance  peiir. 
The  filter  with  the  matched  pair  that  causes  to  be  the  smallest  will  also  cause 
’“ir+i  (4.3.3)  to  have  the  smallest  expected  value  of  all  the  filters.  The  probabihty 
(4.3.1)  of  this  matched  filter  will  then  be  the  largest  among  all  the  filters,  and  will 
produce  the  largest  weighting  term  (4.1.26).  If  is  zero,  then  r*,  is  also  zero  for 
all  the  filters.  The  differences  between  the  bias  and  covariance  pairs  are  masked, 
causing  the  probabilities  (4.3.1),  and  therefore  the  weighting  terms  (4.1.26),  of  all 
the  filters  to  converge  to  the  same  value.  At  this  point,  the  MGS  algorithm 
becomes  an  ineffective  estimation  scheme.  However,  the  alternate  estimation 
procedures  described  in  the  next  section  may  provide  a  solution  to  this  problem. 


4.4  Alternate  Estimation  Procedures 

A  possible  solution  to  the  problem  of  zero  dc  system  gain  is  to  determine  if  an 
alternate  measurement  provides  a  nonzero  dc  gain.  For  example,  if  in  a  particular 
zero  dc  gain  system  acceleration  measurements  are  taken,  changing  to  velocity 
measurements  may  provide  a  nonzero  dc  gain.  If  the  actu£il  sensors  producing 
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these  measurements  cannot  be  changed,  integrating  the  acceleration  measurement 
data  to  produce  approximate-velocity  measurement  data  may  allow  the  MGS 
algorithm  to  work  effectively. 

As  shown  in  [1]  and  by  (A. 5)  of  appendix  A,  the  force  to  acceleration  modal 
system  produces  acceleration  measurement  data  at  sensor  grid  points  and  has  a 
zero  dc  gain.  This  system  is  termed  the  acceleration  modal  system,  or  AMS. 
Changing  this  AMS  to  produce  velocity  measurement  data  at  sensor  grid  p>oints 
results  in  a  new  transfer  function  with  a  nonzero  dc  gain.  This  new  system  is 
termed  the  velocity  modal  system,  or  VMS.  Typically,  changing  the  actual  system 
cannot  be  done  in  practice.  However,  given  a  large  enough  signal  to  noise  ratio 
between  the  input  signal  and  measurement  noise  a  good  approximation  of 
the  velocity  measurement  data  can  be  generated  by  integrating  the  acceleration 
measurement  data  from  the  AMS  using  a  discrete-time  integrator.  Redesigning  the 
measurement  covariance  (4.3.2)  and  residual  (4.3.3)  to  be  based  on  the  VMS 
while  actually  using  the  integrated  acceleration  {approximate-velocity) 
measurement  data  from  the  AMS  produces  proper  weighting  terms  (4.1.26)  that 
allow  the  MGS  algorithm  to  work  effectively. 

In  order  to  illustrate  this  alternate  estimation  procedure,  the  system  and  filter 
equations  for  the  VMS  aure  developed  below.  Using  a  zero-order-hold  model  for  the 
continuous-time  force  input  to  velocity  output  system  described  by  (A.6)  of 
appendix  A,  the  resulting  discrete-time  system  with  added  noise  terms  is 

«fc+i  =  -I- Tufc -I- (4.4.1) 

Vk  =  +  vji;  (4.4.2) 

Uk  =  h  +  (4.4.3) 
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where 


is  the  state  vector 


^k+i 

Vk 


^k 


^k 


Wfc 


is  the  velocity  measurement  vector 

is  a  zero  mean  white  Gaussian  plant  noise  process  with 
covariance  Q 

is  a  zero  mean  white  Gaussian  measurement  noise  process 
with  covariance  R„ 

are  the  respective  constant  transition  matrices 
is  the  vector  GS  signal  model  of  the  actual  nonGaussian 
input  signal,  comprised  of  semi-Markov  bias  vector  6^,  and 
zero  mean  white  Gaussian  noise  rii^  with  semi-Markov 
covariance 


and  the  random  quantities  iwj,,  vl,  6^,  and  are  assumed  to  be  mutually 
statistically  independent.  The  state  model  (4.4.1)  is  exactly  the  same  as  the  model 
used  previously  (4.1.1),  so  that  the  states  of  the  new  VMS  are  exactly  the  same  as 
the  states  of  the  previous  AMS.  The  Gaussian  sum  signal  model  (4.4.3)  is  also  the 
same  as  the  model  used  previously  (4.1.3).  The  measurement  model  (4.4.2)  is 
different  than  (4.1.2)  because  is  a  velocity  measurement  while  is  an 
acceleration  measurement.  Other  differences  in  (4.4.2)  include  a  new  transition 
matrix  H„  and  a  new  measurement  noise  process  vl  with  new  covariance  i2„. 

Following  the  same  procedures  in  section  4.1,  the  MGS  algorithm  is  applied  to 
the  system  of  (4.4.1)  -  (4.4.3),  producing  the  overall  state  estimate 

«t+i  =  I  n+i)  (4-4.4) 

«=i 

where  Y is  the  current  velocity  measurement  sequence  ^k'+i 
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is  given  by  the  Kalman  filter  equation 


+  +  (4.4.5) 

with  covariance  and  gain  equations 

M£'+i  =  +  r5.r  +  (4.4.6) 

^r+i  =  (4.4.7) 

=  (4.4.8) 

and  weighting  terms 

^k+i  =  P(bvSi  I  K/t+i)  =  Q+iP(yfc+i  I  hSi,Yk)Yi^ysWk^  (4.4.9) 

with 

PiVM  I  KS„Yt)  =  +  Tb,),  +  «J  (4.4.10) 

^Vs  —  p{^k+l  ~  ^if^k+l  —  I  ^k  —  ^y^k  ~  ^  j)  (4.4.11) 

wl^  =  p{b,  =  bj,S,  =  Si\Y,)  (4.4.12) 

where  C^i  is  the  scale  factor  determined  at  each  time  interval  such  that  the  sum 
of  all  the  weighting  terms  (4.4.9)  is  equal  to  one. 

The  conditional  density  (4.4.10)  of  the  VMS  is  a  function  of  the  velocity 
measurement  covariance 

=  +  (4.4.13) 

APPUCATION  TO  A  MODAL  SYSTEM  8S 


ajid  the  i**  velocity  measurement  residual 


rti  =  Vk^i  -  +  r6,)  (4.4.14) 

which  requires  velocity  measurement  data  yt+i-  This  data  is  not  avmlable,  since 
the  actual  system  being  simulated  is  the  AMS  which  produces  acceleration 
measurement  data  Zt+v  Iii  order  to  generate  the  proper  probabilities  (4.4.10) 
needed  by  the  weighting  terms  (4.4.9)  for  effective  MGS  estimation,  approximate- 
velocity  measurement  data  generated  from  the  integrcd  of  of  the  AMS,  is 
used  in  place  of  y^+i  (4.4.14)  and  (4.4.5).  This  requires  a  new  Kalman  filter 

iJVi  =  + rt, + JfSViR+i  -  HAiif + rij)|  (4.4.15) 

and  new  weighting  terms 

=  p(6.,5, 1  n+x)  =  CUiPih+i  1 6.,5,,n)f;^iW  (4.4.16) 

j=i 

requiring 

p(!/fc+i  1 6,1 5„y*)  =  +  r^),  +  iZj  (4.4.17) 

wV  =  p{h  =  h^,S^  =  S,\Y^)  (4.4.18) 

The  Kalman  gain  amd  covariance  equations  (4.4.6)  -  (4.4.8)  and  Markov 
probabilities  (4.4.11)  can  be  used  without  modification  since  they  are  not 
dependent  on  the  approwmofe- velocity  measurement  data  j/fc+i. 

The  overall  estimate  can  be  calculated  by  three  different  methods.  The  first 
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method  produces 


®fc+i  =  I  ^t+i)  (4.4.19) 

«=i 

where  xjt+i  the  state  estimate  (4.1.16)  from  the  AMS.  There  weis  never  a 
problem  in  calculating  these  AMS  state  estimates.  It  was  the  problem  in 
calculating  the  weighting  terms  that  necessitated  an  alternate  estimation  scheme. 
However,  (4.4.19)  requires  two  separate  parallel  banks  of  Kalmcin  filters:  one  using 
(4.1.16)  to  calculate  and  a  second  using  (4.4.15)  to  calculate 

wIVi  =  p(».,5iin+.). 

The  second  method  produces 

Cl  =  I  n+i)  (4-4-20) 

(=1 

where  xJJ+i  is  the  i  state  estimate  (4.4.15)  using  approitmate- velocity 
measurement  data  with  the  VMS.  This  overall  state  estimate  (4.4.20)  requires 
only  one  bank  of  parallel  Kalman  filters  using  (4.4.15)  to  calculate  both  and 
w%\i  —  p{hi,Si  I  Vfc+i).  This  provides  good  computationed  savings  over  calculating 
(4.4.19).  However,  (4.4.19)  uses  the  actual  AMS  measurement  data  z*  to  calculate 
wfiife  (4.4.20)  never  uses  z*  in  its  direct  form. 

The  third  method  uses  the  same  state  equation  (4.4.1)  and  GS  signal  model 
(4.4.3)  as  the  AMS  and  VMS,  but  requires  the  use  of  a  second  new  measurement 
formed  by  combining  z*  (4.1.2)  and  y*  (4.4.2)  into 

= +  4  (4.4.21) 
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where 


H 


(4.4.22) 


Again,  velocity  measurement  data  yi^  is  not  available,  so  approximate-velocity 
measurement  data  is  used  instead  producing 


Vk 


(4.4.23) 


The  resulting  Kalman  hlter  equation  is 


sf+i  =  iiV + r6,  +  iiriViia+,  -  + r^,)]  (4.4.24) 


with  covariance  and  gain  equations  (independent  of  y^) 


Mj^Vi  =  +  r5.r^  +  (4.4.25) 

iff+.  =  (4.4.26) 

PiV,  =  (/-«■?+, /f()MiV.  (4.4.27) 


where  the  covariance  of  is  given  by 


R  0 
0  R„ 


(4.4.28) 


and  the  resulting  weighting  terms  sae 


4Vi  =  P(6.,5. 1  Z*+,)  =  CUMlk^x  I  K,sSk)f,^Vs^V  (4.4.29) 

>=i 
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where  is  the  current  measurement  sequence  {CnC25---?Cfc}»  ^6^  is  given  by 

(4.4.11),  and 

pSm  I  hSA)  =  N|/fc($^'  +  rJi)  +  +  iJJ  (4.4.30) 

>"P  =  P(V  =  »i.St  =  5y|ZJ  (4.4.31) 

Forming  measurement  data  (4.4.23)  allows  both  the  state  estimate  (4.4.24)  and 
weighting  term  (4.4.29)  to  be  influenced  by  both  and  y^.  at  the  same  time,  while 
requiring  only  one  parallel  bank  of  Kalman  filters.  The  overall  estimate  produced 
by  using  this  third  method  is 

^ +1  =  Y.  ^+iPih  Si  1  Zfc+i)  (4.4.32) 

i=I 

These  alternate  estimation  procedures  will  be  compared  and  evaluated  in  the 
next  section. 

4.5  Comparison  of  the  Alternate  Estimation  Procedures 

The  alternate  estimation  procedures  described  in  the  previous  section  are 
compared  and  evaluated  using  a  simulation  example.  The  overall  state  estimate 
**+1  (4.4.4)  based  on  the  VMS  using  true  velocity  measurement  data  will  also  be 
calculated  as  a  benchmark  for  comparison  to  the  three  alternate  overall  state 
estimates.  For  convenience.  Table  6  provides  a  summary  of  the  overall  state 
estimates  used  in  the  comparison.  A  force  input  to  acceleration  output  modal 
system  consisting  of  three  modes  serves  as  the  simulation  example.  The  modes  are 
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calculated  using  (A. 7)  -  (A. 14)  of  appendix  A.  The  parameter  values  of  a,-,  Zc„  x, 
and  /,  used  in  the  three  mode,  six  state  model  are  given  in  Table  7. 


Table  6. 

Overall  state  estimates  to  be  compared. 


a. 

M 

sUi  =  £*»+iP(*'..'S,|yi+,) 

t=l 

Benchmark  estimate,  state  estimate 

and  weighting  term  based  on  VMS 
using  true  velocity  measurement  data. 

b. 

^1+1  = 

i=l 

i**  state  estimate  based  on  AMS. 
Weighting  term  based  on  VMS  using 
approximate-velocity  measurement  data. 

1 

^fe+1  =  1  i'fe+i) 

<sl 

state  estimate  and  weighting  term 
based  on  VMS  using  approximate-velocity 
measurement  data. 

1 2fc+i) 

i=l 

_ 

i*^  state  estimate  and  weighting  term 
based  on  combined  AMS  and  VMS  using 
acceleration  measurement  data  and 
approximate-velocity  me^lsurement  data. 

Table  7. 

Parameter  values  for  the  three  mode  model. 


i 

O'. 

Zcj  (rad) 

I  (m) 

/.  (Hz) 

1 

1.446813 

5.686 

17.5 

95.63 

2 

4.934421 

0.705 

35.0 

67.47 

3 

13.721530 

1.598 

52.5 

28.02 

APPUCATION  TO  A  MODAL  SYSTEM 


90 


The  system  transition  matrices  are  found  from  appendix  A  equations  (A. 5)  for 
the  AMS  and  (A. 6)  for  the  VMS.  Using  a  zero-order-hold  model,  a  sampling  time 
of  T  =  0.001  seconds,  and  the  values  in  Table  7,  the  discrete-time  matrices  for  the 
AMS  cure 


0.8250 

0.0009 

0 

0 

0 

0 

-339.2108 

0.8223 

0 

0 

0 

0 

0 

0 

0.9118 

0.0010 

0 

0 

0 

0 

-173.5593 

0.9022 

0 

0 

0 

0 

0 

0 

0.9846 

0.0010 

0 

0 

0 

0 

-30.6059 

0.9577 

0.0069 

0.0033 

-0.0015 

-2.7667 

-4.5899 

-4.9021 

0.0014 

-0.0040 

-0.0071 

-3.0180 

-2.5293 

-0.5295 

-0.0059 

0.0023 

0.0038 

-0.0314 

0.9420 

-0.8696 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

1 

-2807.66 

16.1818 

-5625.67 

6.0323 

-729.65 

-11.5741 

-7693.42 

10.9286 

-5815.17 

-5.2798 

2099.60 

2.6883 

-10046.6 

2.0778 

-2530.33  - 

-13.4374 

-1278.69 

9.0026 

110.7665 

-45.3759 

-166.8950 

-45.3759 

-166.8950 

-41.4010 

-166.8950 

-41.4010 

-55.5049 

(4.5.1) 


(4.5.2) 


(4.5.3) 


(4.5.4) 


(4.5.5) 
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The  transition  matrix  needed  for  the  VMS  is 


16.2043 

0.0078 

6.3411 

0.0313 

-10.9321 

0.0234 

10.9903 

0.0213 

-4.9605 

0.0324 

0.8409 

-0.0673 

2.1583 

0.0278 

-13.2985 

0.0141 

10.1277 

0.0410 

The  measurement  model  of  the  combined  AMS  and  VMS  requires  and 
matrices  formed  using  (4.4.22).  The  plant  noise  of  the  state  model  is  zero-mean 
white  Gaussian  with  covariance 


Q  = 


1.4142  0  0 

0  0.2828  0 
0  0  0.0344 


(4.5.7) 


and  is  uncorrelated  with  AMS  measurement  noise  and  VMS  measurement  noise 
Ufe.  Both  are  zero-mean  white  Gaussian  with  respective  covariances 


26.36 

0 

0 

0.0264 

0 

0 

R  = 

0 

19.77 

0 

0 

0.0198 

0 

0 

0 

20.41 

0 

0 

0.0204 

The  measurement  noise  covariance  of  the  combined  AMS  and  VMS  is  given  by 
(4.4.28). 

The  nonGaussian  input  signal  is  the  stochastic  FM  signal  generated  by 


u(<)  =  A„  sin(27r/„<  -f  J*m(r)dr) 


(4.5.9) 
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where  the  amplitude  is  =  5,  the  carrier  frequency  is  /„  =  10  Hz,  the  modulation 
index  is  =  100,  and  m(t)  is  zero- mean  white  Gaussian  noise  with  vaxiance  1. 
Figure  12  shows  the  FM  signal  for  0.2  second.  The  probability  density  function  of 

(4.5.9)  has  the  same  form  as  (4.2.10). 

The  nonGaussian  input  signal  (4.5.9)  will  be  modeled  in  discrete-time  using 
the  Gaussian  sum  (GS)  signal  model  described  in  section  4.1 


Wfc  —  ^ifc  + 


(4.5.10) 


with  GS  density  approximation  (4.2.12).  For  simplicity,  the  nonGaussian  input 
signed  is  applied  only  to  mode  1. 

Three  filters  are  used  in  each  MGS  adaptive  filter  of  Table  6.  Using  the  vedues 
of  Table  4,  the  and  5,  parameters  for  the  three  filters  are 


-5 

5 

1 

o 

1 _ 

II 

0 

0 

II 

1 

o  o 

_ 1 

II 

0 

0 

(4.5.11) 
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S3  — 

0 

4.28 
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0 

0.04 

0 

0 

4.28 

(4.5.12) 


Three  of  the  MGS  adaptive  filters  of  Table  6  require  approximate-velocity 
measurements.  A  discrete-time  integrator  implementing  the  trapezoidcd  rule  is 
used,  given  by  [59] 


Vk+i  =  y*  +  +  ^k) 


(4.5.13) 
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Figure  12.  Stochastic  FM  input  signal. 
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Figure  13a  shows  the  mode  1  acceleration  measurement  2^,  while  Figure  13b  shows 
both  the  mode  1  actual  velocity  measurement  and  opyroiimate-velocity 
measurement  y^.  The  two  velocity  measurements  track  fairly  closely  under  low 
noise  conditions. 

Each  MGS  adaptive  filter  is  initialized  with  equally-valued  weighting  terms 

=  i  for  j  =  1,2,3  (4.5.14) 

A  Markov  transition  probability  matrix  0^^,  consisting  of  O'l/g  elements,  is 

configured  with  a  high  probabihty  that  the  bias  term  does  not  switch  from  one 
value  to  another,  and  a  low  probability  that  the  bias  term  does  switch,  such  as 

.85  .075  .075 

0t^s=  .075  .85  .075  (4.5.15) 

.075  .075  .85 


The  initial  values  of  the  state,  state  estimate,  and  error  covariance  for  each  MGS 
adaptive  filter  are 


10000  0  0  0  0  0 
0  10000  0000 
0  0  10000  0  0  0 
0  0  0  10000  0  0 
0000  10000  0 
0  0  0  0  0  10000 


(4.5.16) 
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(a) 


Time  {kT) 

(b) 


Figure  13.  (a)  Mode  1  acceleration  measurement  (b)  mode  1  actual  velocity 
measurement  y*  and  approximate-velocity  measurement  jffc. 
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Each  plot  in  Figure  14  shows  the  position  state  from  mode  1  and  an  overall 
position  state  estimate  from  a  particular  MGS  adaptive  filter.  Figures  14a,  b,  c, 
and  d  follow  the  designations  of  Table  6:  estimator  a,  iJJ+i;  estimator  6,  ij+i; 
estimator  c,  estimator  d,  ij+j.  Each  plot  in  Figure  15  shows  the  velocity 

state  from  mode  1  and  an  overall  velocity  state  estimate  from  a  particular  MGS 
adaptive  filter.  The  states  and  state  estimates  are  shown  for  mode  2  in  Figures  16 
and  17,  and  for  0.4  seconds  of  mode  3  in  Figures  18  and  19.  Apart  from  benchmark 
estimator  a,  the  best  estimates  appear  to  come  from  estimators  d  and  c,  while 
estimator  b  deviates  the  most  from  the  actual  state. 

Figure  20a  shows  the  slightly  noisy  weighting  terms  used  in  estimator  a. 
Figure  20b  shows  the  weighting  terms  used  in  estimators  b  and  c,  which  are 
smoother  due  to  the  effect  of  integrating  the  acceleration  measurements.  Figure 
20c  shows  the  weighting  terms  used  in  estimator  d,  which  are  also  smoother  thzin 
those  of  estimator  a,  and  reach  and  remaun  at  their  peak  values  more  so  than  those 
of  estimators  6  and  c. 

Table  8  shows  a  normalized  mean-square-error  percentage  measure  for  the 
state  estimators,  calculated  using  (4.2.33).  The  mse  percentages  are  the  lowest  for 
benchmark  estimator  a,  followed  by  estimators  d,  c,  and  a. 

Figure  21a  shows  the  three  mode  1  residuals  (4.2.3)  produred  from  the  MGS 
adaptive  filter  using  the  zero  dc  gain  force  to  acceleration  modal  system  (A. 5)  of 
appendix  A.  The  three  residuals  become  equal  once  steady-state  is  reached. 
Figure  21b  shows  the  three  mode  1  residuals  produced  from  the  MGS  adaptive 
filter  using  the  nonzero  dc  gain  force  to  velocity  modal  system  (A. 6)  of  appendix  A. 
The  residuals  remain  biased  throughout  the  simulation. 
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Figure  14.  Mode  1  position:  (a)  State  and  estimate  xjj+i,  (b)  state  and  estimate  Xj+i, 
(c)  state  and  estimate  (d)  state  and  estimate  xj[.+i. 
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Figure  15.  Mode  1  velocity:  (a)  State  and  estimate  (b)  state  and  estimate  x|.  j, 
(c)  state  and  estimate  xj^+n  (d)  state  and  estimate  xjt+i* 
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—  Mode  2  Position 
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0  0.05  0.1  0.15  0.2 
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Figure  16.  Mode  2  position:  (a)  State  and  estimate  ifc+i,  (b)  state  and  estimate  x|+i, 
(c)  state  and  estimate  (d)  state  and  estimate 
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Figure  17,  Mode  2  velocity:  (a)  State  and  estimate  ijt+j,  (b)  state  and  estimate 
(c)  state  and  estimate  (d)  state  and  estimate  xj^+j. 
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Time  {kV) 

(b) 


Time  {kT) 
(c) 


Time  {kT) 
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Figure  18.  Mode  3  position:  (a)  State  and  estimate  xJJ+i,  (b)  state  and  estimate  xj+i, 
(c)  state  2Uid  estimate  (d)  state  and  estimate 
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Figure  19.  Mode  3  velocity:  (a)  State  and  estimate  Xfc+i,  (b)  state  and  estimate 
(c)  state  and  estimate  (d)  state  and  estimate 
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Figure  20. 


(a)  Weighting  terms,  estimator  a,  (b)  weighting  terms,  estimators  b  and  c, 
(c)  weighting  terms,  estimator  d. 
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Table  8. 


Normalized  mean-square-error  percentage  for  state  estimates. 


State 

Estimator  a 

il+i 

Estimator  b 
^1+1 

Estimator  c 
xl+i 

Estimator  d 

Mode  1  position 

1.60 

3.62 

17.04 

11.71 

Mode  1  velocity 

0.87 

26.70 

6.93 

5.93 

Mode  2  position 

1.02 

33.83 

15.32 

12.20 

Mode  2  velocity 

1.74 

11.21 

6.09 

4.18 

Mode  3  position 

0.94 

24.73 

16.70 

13.06 

Mode  3  velocity 

0.39 

39.03 

0.34 

0.38 

mse  %  = 


E[x^] 


xlOO 
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Time  (/c7) 
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Figure  21.  (a)  Residuals  using  the  zero  dc  gain  force  to  acceleration  modal  system, 
(b)  residuals  using  the  nonzero  dc  gain  force  to  velocity  modal  system. 
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5.0  PARAMETER  UPDATING  METHODS 


As  described  in  chapter  4,  the  input  to  the  modal  system  is  unknown,  but  its 
frequency  characteristics  are  known  and  the  input  is  always  present.  It  is  also  not 
directly  measurable  by  the  sensors.  The  nonGaussian  nature  of  the  stochastic  FM 
input  signal  is  modeled  using  the  6j  and  5,-  parameters  of  the  Gaussian  smn  signal 
model.  These  parameters  are  closely  related  to  the  amplitude  of  this  signal,  due  to 
the  nature  of  its  probability  density  function  (4.2.10).  Two  of  the  6,  bias  terms  are 
equal  to  the  positive  and  negative  peak  amplitudes  of  the  FM  signal,  with  small 
corresponding  variances.  The  5j  term  associated  with  the  6,  =  0  bias  term  has  a 
value  of  roughly  ,  where  is  the  peak  value  of  the  input. 

The  stochastic  FM  signals  used  in  the  simulation  examples  in  chapter  4  had 
constant  amplitudes.  In  practice,  the  amplitude  may  shift  over  a  period  of  time. 
For  good  performance  under  these  conditions,  the  MGS  adaptive  filter  must  have 
some  approximate  knowledge  of  the  amplitude  of  the  FM  signal  in  order  to  update 
the  bi  and  5,  parameters.  Two  methods  of  updating  the  parameters  of  the  MGS 
adaptive  filter  are  described  in  this  chapter.  One  method  involves  processing  the 
measurement  residual  of  the  filter  incorporating  the  6,  =  0  (zero-bias)  term.  The 
second  method  involves  the  use  of  a  Gaussian  double-sum  to  detect  a  shift  in 
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amplitude.  A  simulation  example  is  used  to  examine  the  performance  of  these  two 
methods. 


5.1  Zero-Bias  Measurement  Residual  Method 

One  method  of  updating  the  and  S,  pcirameters  of  the  MGS  adaptive  filter 
involves  processing  the  measurement  residual  of  the  filter  in  the  parallel  bank  of 
Kalman  filters  incorporating  the  6^  =  0  (zero-bias)  term.  Using  the  modal  system 
model  of  chapter  4  and  the  MGS  adaptive  filter  equations  of  Table  3,  the 
measurement  residual  is 


=  ^k+i  ~  Dhi  -  if ($4  -I-  Tbi)  (5.1.1) 

For  notational  clarity,  set  t  =  0  and  6o  =  0  (5.1.1)  to  produce  the  zero-bias 

measurement  residual 


=  (5.1.2) 

Substituting  measurement  equation  (4.1.2)  at  time  iteration  k+1  into  (5.1.2)  gives 

^fc+i  =  Hxk+i  -I-  Duk+i  +  (5.1.3) 

and  substituting  plant  equation  (4.1.1)  into  (5.1.3)  gives 


^fc+i  =  +  Fujt  -f  <lfwt)  -I-  I>Ufc+i  -I-  Vk+i  -  H^xl  (5.1.4) 
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Rearranging  (5.1.4)  produces 


rk+i  =  H^Xk-^k)  +  HTuk  +  Duk+i  +  H'<i!Wk  +  Vk+i  (5.1.5) 

Now,  assuming  that  the  input  signal  is  much  larger  than  both  and  such 
that  these  two  terms  can  be  neglected,  assuming  that  the  input  signal  changes 
slowly  enough  such  that  u^.  is  approximately  equal  to  and  setting  the  error  of 
the  zero-bias  filter  to  be  =  (r/t  —  r?),  of  (5.1.5)  becomes  approximately  equal 
to 

rU,c,H^eU{Hr  +  D)u,  (5.1.6) 

Premultiplying  both  sides  of  (5.1.6)  by  the  inverse  of  {HT  -f  D)  if  {HT  +  D)  is 
square,  or  by  its  pseudo-inverse  if  non-square,  and  taking  the  expected  value,  the 
zero- bias  measurement  residucd  is  processed  as 

ajt,  =  (HT  +  ^  (HT  +  C)->OT£1eS)  +  .El«il  (5.1.7) 

Assuming  that  the  expected  value  of  the  error  is  E[e^  =  0,  and  given  that 
processed  zero-bias  meastirement  residual  is  approximately  equal  to 

the  input  signal 

«ifc+i  ^  (5.1-8) 

Information  contained  in  is  used  to  periodically  update  the  6,  and  5, 
parameters  of  the  MGS  adaptive  filter.  The  zero-bias  residual  (5.1.2)  is  computed 
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and  stored  at  each  iteration.  After  n  iterations,  the  residual  sequence  is  processed 
using  (5.1.7),  producing  (5.1.8).  The  positive  and  negative  peak  values  of  (5.1.8) 
are  found  from  the  sequence,  and  the  power  of  the  sequence  is  calculated.  A 
combination  of  these  values  is  used  to  update  the  6,  and  5,  parameters  in  order  to 
reduce  errors  related  to  the  assumptions  used  in  the  development  of  (5.1.8).  Due 
to  the  sinusoidal  nature  of  the  input  signal,  the  (negative-bias)  and  63  (positive- 
bias)  parameters  can  be  updated  using 

bi  =  0P„-{l-l3)^P~  (5.1.9) 

62  =  /3^p  +  (l-/3)^f2P~  (5.1.10) 


where  P„  is  the  negative  peak  value  of  the  sequence,  P^  is  the  positive  peak  value 
of  the  sequence,  is  the  power  of  the  sequence,  and  ^  is  a  weighting  factor 
between  zero  and  one.  Once  62  (5.1.10)  is  updated,  the  53  (variance  of  the  zero- 
bias  filter)  parameter  is  updated  using 


53  =  (0.462)*  (5.1.11) 

After  all  the  parameters  are  updated,  the  sequence  is  cleared  and  a  new  sequence  is 
initiated  for  the  next  n-sample  update. 

A  simulation  example  illustrating  the  use  of  the  zero- bias  measurement 
residual  method  of  updating  the  6j  and  S,  parameters  is  now  given.  A  modified 
first-order  discrete-time  system  from  [60]  is  used,  given  by 

Xk+i  =  ^Xt-\-Tuk  +  ^Wk  (5.1.12) 

+ -Dufc  +  Ufc  (5.1.13) 
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with  values 


$  =  0.7  r  =  1.0  $  =  1.0  Jf  =  0.5  £>  =  2.0  (5.1.14) 

The  plant  noise  is  zero-mean  white  Gaussian  with  covariance  Q  =  0.05,  and  is 

uncorrelated  with  measurement  noise  Vi^,  which  is  also  zero-metin  white  Gaussian 
with  covariance  R  =  0.05. 

The  nonGaussian  stochastic  FM  input  signal  (4.2.9)  in  this  example  has  an 
amplitude  of  A^  =  5  for  the  first  60  iterations  and  an  amplitude  of  A^  =  2  for  the 
remaining  140  iterations.  The  input  is  modeled  using  the  Gaussian  sum  signal 
model  described  in  section  4.1 

Ufc  =  6fc  +  n*  (5.1.15) 

Three  filters  of  the  form  found  in  Table  3  axe  used  in  the  simulation.  The  initial  6, 
and  5,  parameters  for  the  three  filters  are 

6j  =  -5  62  =  5  63  =  0  (5.1.16) 

5,  =0.04  52  =  0.04  53  =  4.28  (5.1.17) 

The  MGS  adaptive  filter  is  initialized  with  equally- valued  weighting  terms 

«;g  =  ^,  fori  =1,2,3  (5.1.18) 

A  Markov  transition  probability  matrix  0J'^,  consisting  of  9\^s  elements,  is 
configured  with  a  high  probability  that  the  bias  term  does  not  switch  from  one 
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value  to  another,  and  a  low  probability  that  the  bias  term  does  switch,  such  as 


.90 

.05 

.05 

.05 

.90 

.05 

.05 

.05 

.90 

(5.1.19) 


The  initial  values  of  the  state,  state  estimate,  and  error  covariance  for  the  MGS 
adaptive  filter  are 


a;„  =  0  Xo  =  16  P„  =  10000  (5.1.20) 

Figure  22  compares  the  stochastic  FM  signal  with  the  processed  zero-bias 
measurement  residual  (5.1.8).  The  two  are  quite  similar  in  appearance.  Figure  23a 
shows  the  state  and  the  state  estimate  without  using  the  parameter 
updating  method.  The  initial  estimates  are  good  since  the  parameters  are  properly 
matched  to  the  input.  However,  after  the  input  switches  amplitude  from  =  5  to 
=  2,  the  estimates  degrade  since  the  parameters  are  now  mismatched.  Figure 
24a  shows  the  state  and  the  state  estimate  using  the  parameter  updating 
method  every  30  iterations.  The  initial  estimates  track  well.  Then  the  estimator 
goes  through  a  period  of  learning  a  new  set  of  parameter  values  as  the  amplitude  of 
the  input  changes.  Once  the  correct  parameters  are  found,  the  estimator  tracks 
with  minimal  error.  Figure  24a  shows  the  weighting  terms  of  the  MGS  adaptive 
filter  with  no  parameter  updating,  and  Figure  24b  shows  the  weighting  terms  of  the 
MGS  adaptive  filter  with  parameter  updating.  Note  how  only  the  MGS  filter  with 
parameter  updating  develops  proper  weighting  terms  for  the  estimates  during  the 
time  the  signal  has  an  amplitude  of  =  2. 
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Figure  22.  Stochastic  FM  signal  and  processed  zero-bias  measurement  residual. 
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(a) 


Iteration  (/c) 

(b) 


Figure  23.  (a)  State  and  state  estimate  with  no  parameter  updating,  (b)  state  and 
state  estimate  using  zero-bias  measurement  residuaJ  parameter  updating  method. 
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Figure  24.  (a)  Weighting  terms  with  no  parameter  updating,  (b)  weighting  terms 
using  zero-bias  measurement  residual  parameter  updating  method. 


PARAMETER  UPDATING  METHODS 


115 


5.2  Gaussian  Double-Sum  Method 


A  second  method  of  updating  the  parameters  of  the  MGS  adaptive  filter 
involves  the  use  of  a  Gaussian  double-sum  to  detect  a  shift  in  amplitude.  The 
nonGaussian  signal  model  (5.1.12)  has  a  Gaussian  sum  probability  density  fimction 
of  the  form 


M 

p(u)='£PMhSi]  (5.2.1) 

i=l 

Now,  a  new  nonGaussian  input  signal  model  is  generated  by  multiplying  the 
original  model  by  a  constant  gain,  producing 

s  =  Au  =  i4{fe -f  n)  (5.2.2) 

The  gain  A  is  a  semi-Markov  process  with  values  randomly  selected  from  a  fixed 
set  of  discrete  values,  characterized  by  a  delta  probability  density  function 

p(A)=f;P/5(A-A^)  (5.2.3) 

j=t 


with 


=  P,^>0  for>  =  l,2 . N  (5.2.4) 

i=i 

This  process  can  be  thought  of  as  a  randomly-switching  gain,  each  gain  value  Aj 
having  probability  Pf.  It  is  assumed  that  A  and  u  are  statistically  independent 
random  processes. 
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The  probability  density  function  of  the  new  nonGaussian  input  signal  model 
(5.2.2)  is  found  using  a  general  result  from  [61] 


pW  =  I  i^i  pMA)pM)iA  (5.2.5) 

substituting  (5.2.3) -into  (5.2.5)  gives 

P(»)  =  -  A,)dA  (5.2.6) 

Interchanging  integration  and  summation 

P('')  =  E  [|r„  (ii  (5.2.7) 

Using  the  sifting  property  of  the  delta  function  [54] 

/(a)=  r  f{x)6{x-a)dx  (5.2.8) 

J  — OO 

the  integral  of  (5.2.7)  is  evaluated  as 

pW  =  E^’y[ri-p.(‘'M>)l  (5-2-9) 

i=i 


The  Gaussian  density  N[5i,5i]  of  (5.2.1)  h2is  the  form 
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Solving  (5.2.2)  for  u  =  sjAj  and  substituting  into  (5.2.10)  produces 


(5.1.11) 


and  rearranging  the  exponent  gives 


Nft,S,)  = 


1 

2 

e  ■- 


(s-Ajbif 


(5.2.12) 


Combining  (5.2.12)  and  (5.2.1)  and  substituting  into  (5.2.9)  produces 


1 


i| 

■5 


{s-Ajb-)^ 

A]S, 


(5.2.13) 


or  in  abbreviated  notation 


M  N 

pW  =  E  E  PiP}  AJSJ  (5.2.14) 

i=i  j=i 

Thus,  the  nonGaussian  density  function  of  s  can  be  modeled  cis  the  Gaussian 
double-sum  (5.2.14).  The  Aj  parameter  is  used  to  account  for  any  shift  in  the 
amplitude  of  the  actual  nonGaussian  input  signal.  Several  values  of  Aj  are  selected 
to  cover  a  range  of  possible  input  amplitudes.  If  the  actual  amplitude  changes,  the 
MGS  algorithm  will  react  by  selecting  the  filters  using  the  most  properly  matched 
Aj  value.  However,  a  total  of  MN  filters  is  now  required  to  implement  the  MGS 
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algorithm,  rather  than  only  M  filters  previously  required.  Following  a  similar 

development  given  in  chapter  3,  the  modified  Gaussian  double-sum  (MGDS) 

adaptive  filter  equations  for  a  modal  system  are  summarized  in  Table  9.  A  block 
diagram  of  the  MGDS  adaptive  filter  is  shown  in  Figure  25. 

The  simulation  example  of  section  5.1  will  be  used  again,  this  time  illustrating 
the  use  of  the  Gaussian  double-sum  method  of  updating  the  MGS  adaptive  filter 
parameters.  Three  6,  and  5,  parameter  values  are  used  again 

=  62  =  1  63  =  0  (5.2.15) 

5i  =  0.04  52  =  0.04  53  =  0.16  (5.2.16) 

In  effect,  these  parameters  remain  fixed.  Choosing  the  proper  Aj  parameter  causes 
the  MGDS  adaptive  filter  values  to  change.  The  53  parameter  is  set  to  (§)*  =  0.16 
to  allow  for  the  correct  scaling  when  multiplied  by  The  5i  and  52  values  are 
not  scaled  by  Aj  since  they  model  the  steep  peaks  at  either  end  of  the  stochastic 
FM  probability  density  function  (4.2.10).  These  two  parameters  properly  model 
these  peaks  for  all  amplitude  values. 

Two  Aj  values  are  used  in  the  MGDS  adaptive  filter  to  cover  the  range  of  the 
input  signal  amplitudes  used  in  the  simulation.  They  2ire 

Ai=2  A2  =  5  (5.2.17) 

The  MGDS  adaptive  filter  is  initialized  with  equally-valued  weighting  terms 

=  g,  for  i  =  1,2,3  and  j  =  1,2  (5.2.18) 
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Table  9. 


Modified  Gaussian  double-sum  adaptive  filter  equations  for  a  modal  system. 


System:  Xjt+i  =  +  Fsjt  -f- 

Zif  =  Hxig  +  Ds/^  -|-  Vie 

=  ^ki^k  +  ”*) 

M  N 

Overall  estimate:  =  j  I  ^k+l) 

i=l  j=l 


Kalman  filter  equation: 

4-Vi  =  -  DAjbi  -  +  r>i,6,)] 


Kalman  gain  equations:  M)/^i  =  4-  -f 

+  DA)SiD-^  +  R)-^ 


Weighting  term: 

“I'+i  =  P{hSi,Ai  I  Zi+.)  =  CU,Ah»  I  hSi,A^,Zk)'E 

a=l  0=l1 

with 

p(^i+i  I  i>i,Si,Ai,Zi,)  =  Ni/r(«ii' + + DA,K  + da%d^ + ij] 

^bS  —  P(^k+1  —  ^i7^k+l  —  I  ^k  —  ^ai^k  =  ^a) 

I  >1*  =  A0) 

=  P{h  =  K^Sk  =  Sa,  Ak  =  A0\Zk) 


M  N 

and  scale  factor  Cl^.i  such  that 

i=i  >=i 
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Two  Markov  transition  probability  matrices,  ©^5  and  0^,  are  used  in  the  MGS 
adaptive  filter,  given  by 


.90 

.05 

.05 

r 

CitQ  _ 

^65  — 

.05 

.90 

.05 

.05 

.05 

.90 

- 

.95 

.05 


.05 

.95 


(5.2.19) 


Figure  26a  shows  the  state  i/t+i  same  state  estimate  Xjt+i  generated 

previously  without  parameter  updating.  Figure  26b  shows  the  state  and  the 
state  estimate  it+i  using  the  Gaussian  double-sum  parameter  updating  method. 
The  MGDS  estimates  track  well  throughout  the  simulation,  without  any  noticeable 
learning  time. 

Figure  27a  shows  weighting  terms  1-3  of  the  MGDS  adaptive  filter  with 
Aj  =  2,  and  Figure  27b  shows  weighting  terms  4-6  with  Aj  =  5.  The  input  signal 
starts  out  with  an  amplitude  of  A^,  =  5,  so  weighting  terms  4-6  initially  are  in 
effect.  After  the  aunplitude  switches  to  Au  =  2,  weighting  terms  1-3  take  over,  with 
one  exception.  Weight  3  (for  6,  =  0)  should  be  larger  than  weight  6  (also  for  5,  =  0) 
since  Aj  =  2  currently  matches  the  actual  input  amplitude.  However,  the 
measurement  residuals  for  corresponding  filters  3  and  6  are  equal,  because  their  6, 
terms  are  each  zero.  This  causes  the  difference  in  their  5,  variance  terms  to  be  the 
deciding  factor  in  calculating  the  weights.  The  Gaussian  density  function  in  Table 
9  used  to  calculate  the  weights  has  a  larger  variance  for  weight  6  {Aj  =  5)  than  for 
weight  3  {Aj  =  2).  Therefore,  the  probability  calculated  for  weight  6  will  be  larger 
than  the  probability  calculated  for  weight  3,  so  that  weight  6  governs  the  overall 
state  estimate  more  so  than  weight  3. 
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Figure  26.  (a)  State  and  state  estimate  with  no  parameter  updating,  (b)  state  and 
state  estimate  using  Gaussian  double-sum  parameter  updating  method. 
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(a) 


Iteration  (/c) 

(b) 


Figure  27.  (a)  Weighting  terms  1-3  with  Aj  =  2, 
(b)  weighting  terms  4-6  with  Aj  =  5. 
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6.0  CONCLUSIONS 


This  investigation  is  concerned  with  effective  state  estimation  of  a  system 
driven  by  an  unknown  nonGaussian  input  with  additive  white  Gaussian  noise,  and 
observed  by  measurements  containing  feedthrough  of  the  same  nonGaussian  input 
and  corrupted  by  additional  white  Gaussi<ui  noise.  A  Gaussian  sum  (GS)  approach 
has  previously  been  developed  [6-8]  which  can  cope  with  the  nonGaussian  nature  of 
the  input  signal.  Due  to  a  serious  growing  memory  problem  in  this  approach,  a 
modified  Gaussian  sum  (MGS)  estimation  technique  is  developed  that  avoids  the 
growing  memory  problem  while  providing  effective  state  estimation.  Several 
differences  between  the  MGS  and  GS  algorithms  are  examined,  showing  the  MGS 
algorithm  to  be  a  better  performer. 

An  MGS  adaptive  filter  is  derived  for  a  general  system  and  a  modal  system, 
with  simulation  examples  performed  using  a  nonGaussian  input  signal.  The  modal 
system  simulation  results  are  compared  to  those  produced  from  an  augmented 
Kalman  filter  based  on  an  augmented  modal  system  model  assuming  a  narrowband 
Gaussian  input  signal.  The  results  show  that  the  MGS  adaptive  filter  provides 
better  estimates  than  the  augmented  Kalman  filter. 

A  necessary  condition  for  effective  MGS  estimation  is  derived,  namely  that  the 
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system  must  have  a  nonzero  dc  gain.  Three  alternate  estimation  procedures  are 
developed  to  compensate  for  situations  when  this  condition  is  not  satisfied.  The 
alternate  MGS  adaptive  filters  are  simulated  and  their  performance  results  are 
analyzed  and  compared. 

Two  methods  of  monitoring  and  updating  key  parameters  of  the  MGS  filter 
are  developed.  One  is  the  zero-bias  measurement  residual  method  and  the  second 
is  the  Gaussian  double-sum  method.  Simulation  results  are  analyzed  to  investigate 
the  performance  of  these  methods,  with  the  Gaussian  double-sum  method  proving 
to  be  a  better  performer  at  the  expense  of  increased  computational  burden. 

Several  contributions  to  the  field  of  applied  estimation  theory  are  made  from 
this  investigation.  These  include: 

1.  The  development  of  a  modified  Gaussian  sum  algorithm  with  nongrowing 
memory  based  on  a  nonGaussian  signal  model  with  a  Gaussian  sum 
probability  density  function.  Parameters  from  this  model  are  used  directly  in 
the  modified  Gaussian  sum  adaptive  filter  structure. 

2.  A  comparison  between  the  GS  filter  of  [6]  and  the  MGS  adaptive  filter.  The 
two  are  similar,  but  the  comparison  shows  the  MGS  adaptive  filter  to  be  a 
good  improvement  to  the  GS  filter. 

3.  An  exeunination  of  a  necessary  condition  for  effective  MGS  estimation.  This 
condition  provides  a  simple  test  to  determine  if  the  MGS  adaptive  filter  will 
work  properly  for  a  given  system. 

4.  An  alternate  configuration  of  the  MGS  adaptive  filter  when  the  necessary 
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condition  of  3  above  is  not  met.  This  configuration  is  applied  in  several  ways, 
and  each  is  evaluated  on  a  performance  basis. 

5.  Two  methods  of  monitoring  and  updating  key  parameters  of  the  MGS 
adaptive  filter.  These  allow  the  estimator  to  react  to  changes  in  the  input 
signal  level  which  cause  the  signal  to  be  nonstationaxy  over  long  i)eriods  of 
time. 

Some  suggested  directions  for  future  development  include  a  closer  examination 
of  the  assumptions  used  in  the  MGS  algorithm  to  alleviate  the  increasing 
computational  and  storage  requirements  of  the  growing  memory  GS  algorithm, 
applying  the  MGS  algorithm  to  a  larger  variety  of  nonGaussian  input  signals,  and 
examining  how  to  implement  the  algorithm  in  the  case  of  nonlinear  systems  with 
nonGaussian  inputs. 
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APPENDIX  A 


The  transfer  matrix  from  actuator 
for  the  modal  model  described  in  [1]  is 


force  inputs  to  sensor  acceleration  outputs 


m 

G(s)  =  sJ; 


•-1L 


s-Xi  s-A- 


(A.l) 


where 

m 

is  the  total  number  of  modeled  (complex)  modes 

A. 

is  the  1*^  eigenvalue  of  the  transfer  matrix 

<t>i 

is  the  mode  shape  vector  (eigenvector) 

Ci 

is  the  modal  participation  factor 

An  equivalent  continuous*  time  state  representation  of  (A.l)  has  the  form 


i(<)  =  Ax{t)  d-  Bu{t)  (A.2) 

y(<)  =  Cx{t)  +  Duit)  (A.3) 


where 
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(A.4) 


®(<)=[9l(0  9l(<)  •••  9m(<)  9m(0] 

are  the  modal  positions  and  velocities,  u{t)  is  the  m-length  vector  of  actuator  input 
forces,  and  y{i)  is  the  m-length  vector  of  accelerations  at  sensor  grid  points. 

Clearly,  the  dc  gain  of  (A.l)  is  zero  since  for  s  =  0,  G(s)  =  0.  Differentiating 
(A.l)  once  with  respect  to  s  removes  the  s  in  front  of  the  summation,  producing  a 

new  transfer  matrix  from  actuator  force  inputs  to  sensor  velocity  outputs.  This 

new  transfer  matrix  has  a  nonzero  dc  gain.  The  discrete- time  equations  (4.1.1)  and 
(4.1.2)  are  generated  using  a  zero-order-hold  model.  Therefore,  the  discrete-time  dc 
gain  (4.2.4)  from  Ujt  to  z^.  is  equal  to  the  continuous-time  dc  gain. 

The  A,  B,  C,  and  D  matrices  of  the  state  space  representation  of  the  force  to 
acceleration  transfer  matrix  are  computed  as  follows.  Define 

A  =  diag{Aj,  Ap,, 

C  =  diag{cj,  Cj, . . .,  c„„ 

^  =  [^1  <f>m  '^m] 

Then 

A  =  A,  B  =  C  =  D  =  (A.5) 

The  A,  B,  C,  and  D  matrices  of  the  state  space  representation  of  the  force  to 
velocity  transfer  matrix  are 

A  =  A,  B  =  Ac4^  C  =  ^PC^A-^  D  =  0  (A.6) 
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The  A  and  B  matrices  of  (A.6)  are  the  same  as  those  from  (A.5).  This  allows  the 
states  (A.4)  to  remain  unchanged  when  either  acceleration  or  velocity  outputs  are 
used. 

Numerical  values  for  the  parameters  in  (A.5)  and  (A.6)  are  generated  using 
the  algorithm  below.  In  these  equations  the  subscript  i  represents  the 
corresponding  mode  number,  and  values  are  generated  for  i  =  l,2,...,m.  The 
eigenvalues  are  given  by 


A.  =  -a,  ±  (A.  7) 

where  the  are  randomly  selected  with  uniform  distributions  over  three  intervals: 
[1,2],  [2,5],  and  [5,15],  generally  with  the  highest  frequency  of  occurrence  in  [1,2], 
followed  by  [5, 15],  and  then  by  [2,5].  The  w,  are  given  by 

=  2irfi  (A.8) 

/.  =  100e‘®-^^“'"^^  (A.9) 


The  eigenfimctions  are  given  by 


^.(x)  =  ^  (A.10) 

where  x  is  the  m  x  1  vector  of  grid  point  locations  distributed  over  a  length  of  70 
meters,  and 


=  ^  (A.ll) 

Vi  =  3000  e"””'  (A.12) 
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The  magnitudes  of  the  modal  participation  factors  are  given  by 


(A.13) 


and  the  phcise  angles  are 

Zc,-  =  random  values,  uniformly  distributed  on  the  interval  [0,27r]  (A. 14) 

Using  the  algorithm  above,  the  parameters  of  (A. 5)  and  (A. 6)  are  completely 
specified  by  ctj,  i,  and  Zc,-  for  i  =  l,2,...,m  and  some  positive  integer  m. 
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Key  \\*ords 

Nonliiirar  pstiuiatioii.  iionliupftr  sf\in|)lp*l-<lflfa  simultil loii.  ionally  ('lii- 

cient  psiimalioii.  slatp  pstimalioti. 

Abstract 

An  psl  iiiial  ion  lp(.  lini((MP  is  {jipsentp<l  lor  llip  (  lass  ol  iioiilinpar  systems  (onsisl  inir 
ol  iiiemoryless  nonliiiparitips  eiiibedderl  in  a  <lyiiaiiiic  linear  syslein.  1  Up  a|j|)roac  li  is 
baserl  on  a  nspf  '1  saniplp'.l-data  non-linear  system  simnlalion  melliocl.  wliicli  iinobes 
I  Up  addition  ol  an  pxlra  slate  xariaUle  lor  earU  noidiin^ar  element.  1  Up  nonlinear 
estimator  is  developerl  along  tlie  lines  (jI  tUe  Uasie  Kalman  stale  eslimalion.  using 
((nasi-liiiearization  instead  ol  tUe  lav  lor  series  linearization  nsp(l  in  E.xtended  Kalman 
Filters.  It  is  demonstrate*.!  tUat  tliis  ne'v  metUo*!.  out  pertorms  tlie  extenderl  Kalman 
filter  in  terms  of  tlie  mean-srpiare  error  ol  lit*'  staf(»  estimate.  1  Uis  estimator  ran 
Up  n.serl  effrctivelv  for  state  estimation  lor  tlios*'  rases  where  the  extenrlerl  Kalmatt 
lilt('r  (Ux'snot  converge.  Moreov('r  tin'  ik'W  nn'tUod  is  dirc'cliy  appliraUU'  to  U'edUaek 
systems  with  multiple  uonlinearities  and  stoeliastie  di'^i m  Uanres. 


1.  Iiitroiluctioii 


Linear  models  are  often  used  by  engineers  to  siinplif}'  the  analysis  and  design 
process  for  physical  systems,  hut  inevital>ly  uonlin<jaitties  appear  when  I  he  mo<lel  is 
K'tinefl.  and  in  many  rases  a  linear  moflel  is  too  crude  to  be  ot  use.  (.'onsef|uently. 
met  hods  for  working  with  nonlinear  systems  are  always  ot  great  interest. 

For  on-line  system  identification  and  control  it  is  customary  to  use  an  estimator 
to  prov  ide  good  state  estimates  Irom  the  available  noise  corrupted  measurements. 
The  past  (piarler  centurv  has  witnessed  many  approaches  to  this  problem.  I»ul  most 
of  the  approaches  resulted  in  filters  with  stowing  memorv  or  of  infinite  dimension  or 
whit  li  were  simply  cumbersome  (ompiitat ionallv.  Che  extended  Kalman  (ilt*T  (h.KH  ) 
evolved  as  a  general  standarti  ap()roat  li  to  iliis  proltlem. 

.Although  state  estimation  of  linear  systems  has  been  thoroughly  studied  and 
<lo<imiented.  corre.spotidijig  studies  lor  nottlinear  systems  are  still  going  on.  In  the 
ma  jority  of  those  studios  attompts  havo  boon  mado  to  linoarizo  or  quasi-linoarizo  the 
nonlinear  model  so  as  to  apply  the  linear  approaches.  Ramnath  and  Paynter  (1)  used 
the  idea  of  a  scaling  transformation  to  rodttce  the  nonlinear  problem  to  an  obviouslj’ 
solvable  form.  For  e.xample,  using  the  change  of  variables  r  =  /////.  the  nonlinear 
Riccati  equation; 

r-h:*  =  / 


can  be  made  equivalent  to  tin'  Inn  ur 

,1  -  /  ,/  =  It 

Tin'  main  disadvantage'  <if  tliis  nn'tlnxl  is  that  it  i-  inu  ocm  i all v  <  Icai  liow  lti 
construct  ati  appropriate  transfovtnation. 

Statistical  linearization  is  another  approach  that  has  been  proposed  to  <leal  with 
nonlinear  systems.  Crandall  [2]  e\'|)lains  that  the  itlea  is  to  ie|)lace  the  noidinear 

•> 


system  etjuatiou  by  aa  e(|uivaleiit  linear  erjualion  involving  a  parainelei  wliitli  is 
selected  to  minimize  tlie  estimation  error.  Tlie  key  step  is  the  evaluation  ol  statist  ical 
expectations  of  certain  functions  of  the  nonlinear  response,  lint  in  any  <  ase  this 
general  approai  h  only  works  well  tor  small  nonlitieanf ies.  E.vteusions  of  (he  statistical 
linearization  technique  were  given  In-  Hedrick  ami  .\rslan  [ }]  ami  Ueaman  and  Hedrick 

[I]- 

\avin  [•')]  discussed  a  morliliefl  \ersioii  ol  the  discrete  Kalman  lilter  lor  the  non¬ 
linear  case.  The  essential  i<lea  is  to  have  a  nonlinear  transition  matrix  \  for  the 
continttous  case,  develop  the  equivalent  discrete  transition  matrix  using  the  first  two 
terms  of  the  Taylor  series  expansion  of  e\p(  .\t  |  and  complete  the  algorithm  along  the 
lines  of  linear  Kalman  lilter. 

Ksiugihe  sample»l-data  tei  hui<iue  presente<|  In  \'anLamlingham  [ti],  \atiLamling- 
ham  and  Moo.seflU]  and  .Moose  and  Laiizon  J7).  Dliingra  (s;]  later  stmlied  variations 
of  estimators  suggested  by  the  lump  Matrix  simulation  method.  The  .lump  Matrix 
modelling  methorl  has  been  siiccessfullv  used  to  morlel  large  seale  s^•stems.  snrh  as 
the  F.S.  Navy  (  lo.se  in  Wenjinn  St/shiu  (("711. S’)  coiitaitiiiig  as  many  as  nine  major 
non-linearities.  Both  the  simulatio  techni((ue  and  the  residting  noidinear  estijnator 
are  discussed  in  detail  in  subseriuenl  sections. 

Section  2.\  presents  the  mndiuear  sampled-data  ^mudatiou  meihod.  l>>l|o\\ed  b\ 
Section  2i3  in  which  the  nonlinear  evt  imai  i< at  Imi'i"'-  i-  > I-  ’  •  I  1 1-  'u  !  "iv  "- 

s«>me  example  siniulat ioiis  <md  '  "iiip.M I--' -ii -  .  iiliih-  •  i-ti.l.  .1  Idi- i 


2.  A  Sampk'(.l-Data  Siiiiuiari<ni  .MetiiM.I  bn  .':'r;tt<'  E>tiinati(»ii 
of  Xoiiliiiear  Stochastic 


A.  Iiitrotliictioii 


Slaitiug  with  a  basic  state  vaiiable  formulation  for  a  plant  model,  a  nonlinear 
sampleci  data  simulation  method  called  the  .Jump  Matrix  Terlmicpie  (  JMT  )  will  be 
introduced.  The  nonlinear  system  simulation  and  design  of  the  estimator  using  the 
IMT  will  thus  form  a  major  part  of  this  section.  The  complete  develo|)ment  of  the 
technifiue  is  given  in  (7j.(l0j. 

( 'onsifler  a  general  set  of  state  efpiations  deseril>ing  a  linear  system 


.4.r  +  till 

Plant  .\bxlel 

(1) 

(  \v  4-  Dll 

Output  .Model 

(•-’) 

If  the  svstem  input  is  samph'd  at  every  F  secoiuls  ami  held  constant  over  the  time 


interval  T.  equation  we  have 

r(^  +  ,)  =  +  Fidt^,)  (:l) 

//(/*.)  =  (1) 

where  <l>  and  1'  are  tlelined  as  lollows 

<t>  =  =  1  +  AT+A^T^/-2]  +  ---  (•’)) 

■j* 

r  =  /  =  (T  +  T^A/2\  +  T'^A^/V.  +  •  aD.  (6) 


Thus  equation  (J)  defines  the  discrete  I  ime  equivalent  ol  the  lontinuous  time  svs¬ 
tem  given  by  ( la.b).  In  advlit ion.  f  liese  i-sis  h-i  m  t  h.  l.i'-i-  .  •!  -i  -mi  ^iuiulal  iuii 
lor  ( he  linear  time  invariatit  sv-^tem .  \  -  -iiliM-  ."  -•  i  -in  -nnu l.i i  i"n  i  -  -  -  -  usu let e- 1  nest . 
1  he  basis  of  the  .|.\1T  is  l"  'iilil  ili<-  lll•llllll<  .h  -v  -i-  mi  nil"  Inn-.n  .nid  iHinlineai 

siibsv  steins,  usiim,  liel  it  ions  saiMpl'T’- .mhI  '  I.mmiiv,  I  tni  miu  i  In-  -.miiiiI-' nil  n  v  al  /  •-  /  ■- 

I L  +  \  the  linear  subsystem  prop'i';ai<"^  lihi-  -m  -•r-linarv  liu'-ar  --v  -ii-ni.  .\i  the  -.nuplinu, 
instant  t  =  the  noidinearity  output  i'  updafevl  ami  then  lu'ld  -  onstant  over  the 
sampling  perioil.  One  implicit  assumption  in  this  anaivsis  is  that  the  noidinearif ies 
are  memorvles.s.  Over  the  sampling  periorl  the  clamping  of  the  nonlinearitv  output 


4 


acts  as  an  kloal  integration  of  an  input  ronsistine;  ot  a  series  ot  mipnlsc'^.  The  <'lamp 
Ditlpnl  is  then  considere*!  as  an  extra  state  variable  oi  t  lie  system.  1  liiis  the  nonlinear 
'^x  stein  is  modeled  as  linear  system  during  the  sample  inler\al  with  a  nonlinear  jump 
in  the  system  state  at  the  sampling  instant.  I  his  is  illnstratefl  in  I'ie.  1. 

For  a  small  time  interval,  this  heroines  an  accurate  piece  wise  constant  approx¬ 
imation  of  the  actual  nonlinearity.  In  terms  ot  a  set  ot  state  ecpiatioiis.  the  system 
using  the  .JMT  hecomes 

=  «!).)•( /^-^ )  -I-  +  i  |7) 

v{tf)  =  ./(.r(t-)l  f  =  fi.  ('<1 

Here  It.)  is  a  nonlinear  matrix  op<*raior.  acting  on  the  state  \ertor  and 

will  l)e  referrerl  to  as  the  iump  matrix  or  vector.  .\t  the  sampling  instant,  only  the 
nonlinear  state  variahles  are  <  liangtsl  instant Iv  ;  thelitieai  variables  are  kef)!  iDiislaiil. 
which  is  in  direct  contrast  with  keeping  the  nonlinear  state  varialile  constant  over  the 
'ampling  period.  .\s  an  example,  consider  the  s\sstetn  definetl  bv  the  erpiation 

,r -I- •f'^.r -t- .r' i-  =  I)  (!>) 


('hoosing  the  two  nonlinearities  as  slate  varialiles.  the  state  erpialioiis  i  an  he  si'l  up 

as  folows 
r,  =  .r 

I'i  = 

r,  =  rj.r, 
r^  =  rfr.. 

\\  ritini>  it  in  a  mat  rix  form 


^  u 


r  (I  I  n  M 


p  (I  -I  -1 
i)  I)  II  II 
IJ  II  <1  II 


h  I  ■ 


'  -  +  i 


ulieip  at  f  =  fk  dip  nonlinear  mxlate  is  given  hy 


■'Vt) 


■nit:) 

■V2(t:) 

■>-Ut:)-n(i:} 


t  =  tk  (ID 


B.  Estimator  Design 

On  llie  basis  of  tlie  iheoiv  developed  in  Section  2A.  the  nonlinear  stochastic  system 
can  be  modeled  as 


Monlinear  .Jump; 

.vt  =  ./(.<•*-) 

t  =  fk 

Linear  Projection  ; 

=  <I>.r^  +  I’t/t  + 

fk  <t  <  /;  +  , 

Measurement ; 

:t  =  II.vt  +  r, 

f  =tk 

The  system  noise  n't,,  and  lu^'asmemenl  noise  e;  are  statist irally  independent, 
imcorrelated.  gaussiaii  random  noises  with  covariance  matrices  Q  and  U  respect i\elv. 
Intuitively,  the  est  imated  state  just  alter  I  he  sampling  instant  should  be  proporl  ional 
to  the  nonlinear  update  of  state  before  the  sampling  instant  ami  also  (he  measurement 
made  at  the  sampling  instant.  So  it  is  assumed  that  the  lilter  has  the  following  form; 


.It  =  ej{.v:)-\-  (12) 

where  0  and  A  are  weighting  matrices  that  will  b^^  dHtHiiniii^'d  Diuiu'i  dif"  'imalioii 
of  the  sample  interval  T.  Hie  lilier  Im  Ii.iv  -  .1-  .m  .•idiii.ii  ,  [in- ..1  •  -1  im.ii-M  .wit  li  ilie 
intersample  response  giicn  In 

'  A,.=  'h'y'  +  I  "•  ( Idi 

Defining  the  (ilter  error  as  n  +  i  =  ■''t+\  ~  ‘"''I  substituting  the  \;dnes  from  above 

we  have 

fr+i  =  T(.r(i  )  —  A[//./(.i ) -f- (  ^^.1]  (14) 


I 


Taking  the  statistical  expectation 

E[e,^i]  =  (  /  -  AH)E[J{.r^^,)  -  (  /  -  A//)-‘0./(.iT^, ))  ( 1.^) 

For  HU  unl)iase<.l  estimator,  we  want  -h-  — *  .«•*.-  as  k-  —*  cc.  So  if  0  =  (I  — 
AH)  then  as  k-  —*  oo.  £[^*4.1]  — ►  0.  Thus  equation  (f})  reduces  to 

/•;[a+i]  =  ( /  -  A//)/-;[./(.c^^,)  -  ( 10) 

The  filter  equation  (12)  can  now  he  rewritten  as 

i-t  =  •/(.;•,-)- A[-a  HJi-i-;.)]  (IT) 


which  is  similar  to  the  state  estimate  iq)date  equation  ot  the  l^asic  exteiulefl  Kalman 
filter. 

The  problem  now  is  to  evaluate  the  statistical  expertation  of  the  term  [•/(.ri'+i )  ~ 
T(  tT+i)]  )  ^  tiourtnear  operator,  it  cannot  he  ilirectly  interehaMge*!  with 

expectation.  Lauzon.  .Moose  [7],  used  the  idea  of  a  time  varying  matrix  rq  +  i  such 
t  hat 

—  •/(.Ci.^i)]  =  "r+i [•*■*,+1  ~  *\+i]  (It') 

Determining  (ik+i  hy  direct  manipulation  of  the  <le(ini(ion  is  impossible  as  the 
value  of  the  actual  state  is  not  known  at  each  iteration.  \\V>  matrix 

•U  +  \  to  he  the  Jacohimi  iiintri.r  -if  ./(.I  ••',.ihi;il' <1  .n  --m*  m-  .m-  .ilni-  |>■>ml  '  /'4.1 
lying  in  the  closed  interval  [rr^.!.  '”+i|  Au-mi  1  li- im.-i  \  ..I  I-  11..1  d- |jii>-d  v\iilH>Mi  the 
knowlefige  ol  i^^i.  But  sinc<^  the  ■•'-1  imat<'r  i-^  iml'ia-'»^d.  ili>'  mi'  i'.il  ^mailer  as 
k'  — >  :x .  I  heretore  t  he  iiU’ail-' able  |>i  •lilt  ('•  apprt>\mial('<l  h\  1  hi' mtal'-d  value. r~^|. 
Proceefling  with  the  (*st ima t or  devi ■!( ipmeul .  1  lu'  •  •^■p^'ct ei I  va | ue  i >1  1  he  csi  ima t oi'  error 
given  by  equation  (16)  becomes 

+  =  (/  -  A// )./*+, <^£’[u.]  (B)) 


ami  l)y  iDaking  use  of  e<|uatioii  (1  1)  and  ( IS)  the  total  eiior  expiessioit  can  he  uiitten 


as 

n+i  =  (  /  -  A//)«u.+i($ca  +  *«'*,.)  -  At't+i  (2U) 

The  error  covariance  matrix  Pr+i.  defined  as  /Tftt+it J^,),  is  determined  using  equa- 
tion  (20). 


,  =(/- AP)[n,+  ,(<&P,<&^+  'l/(^'i»^)c/J^,](/-  //^A^)4- APA^  (21) 


Deliniug 


(22) 


and 


df/^4.1  —  "t  +  l 


(23) 


Equation  (21)  can  be  written  as 


/\+,  =  (  /  -  A//).)/^t„(/  -  P^'A^')  +  APA''  (24) 

In  order  to  determine  the  optimal  filter  gain,  (he  trace  of  tlie  error  covariance  matrix 
Pt+i  is  minimized.  Differentiating  C  =  tr{Pi+i)  with  respect  to  A  and  setting  the 
result  e((ual  to  zero  yields  the  optimal  gain  matrix 

A-  =  +  fl]-' 

This  expression  looks  verv  similar  thv  Kalman  oain  .  'inati-.n.  >)•  timm;  A'  a-'  tin- 
time  varying  gain  Av+l.  f\  j^\  'an  !•••  v\  rill'  II  in  lli-  iitiiin-.  |i.ini  ,p . 

/Ui  =  I  ^  -  A/n  /  -  //' A' t  +  Ah’A'  (2(.) 


which  (hen  reduces  to  dm  lanidiai  ''X(ii'ssi,.ii 


/u.  =  (  /  -  /u^-, //)-)/*';. 


(27) 


I’his  completes  the  design  of  the  estimator,  bummariziug  the  tiller  equations 


Linear  Prediction: 


.<\+i  =  + 

Nonlinear  Update:  -h  [<k+i(:t+i  - 

Gain  Calculation:  AV+i  =  +  /?]'* 

Covariance  Update:  /\.+|  =  |  /  -  /\\+i //).U^t^.l 

3.  Estimator  Analysis 

A.  Introduction 

In  see.  I  ion  2.  tlie  simulation  <jt  iioiiliueat  syxienis  usin^  I  lie  .Iuin|)  Mat  ri\  Fee  lini<(ue 
(  IM  F)  was  first  e^lisrussed  and  the  tevlmique  use'el  to  dr\r|op  an  estimator  for  nonlinear 
sysle’ius.  In  this  see  tion  (wo  spee  ili<  iioiiliiiear  systeuns  ar<^  e onsideifMl  lor  the  purpose 
<il  illus(raling  (lie  esdiiiator  design.  1  he  lirst  s\stem  eln^sen  represen(s  the  nio(ion 
ol  a  particle  under  nonlinear  damping.  \an  der  Pol's  eriuaUon  can  l>e  used  to  mo<.lel 
such  a  system.  The  .second  e.xample  considers  a  feetlLack  coinlrol  system  i  ontaining 
an  internal  saturating  nonlinearity.  I  he  system  is  inlluenced  by  the  presence  of 
wideband  Gaussian  disturbances  anel  additive  Gaussian  measurement  errors. 

B.  \  aii  der  Pol's  Equation 

The  first  example  to  l)e  (•i)iisid''i<'d  i-  tli'-  •  .innaiinM  ■•(  ..|  ,i  |i,ii  i  ich-  ni-  « 

larni’t  iiinlcr  nonliin'ar  daiupiiu:  a.  ni'"!-  !«d  l-v  X.m  d'  r  r-'l’-'  ■  '|iiai  i' ai.  Ihi'  '•ulution 
ol  many  ui»n linear  e<|uat  ions  can  I  li 'iiiid  le  ■  •  •ti  \  •  i  i  mu  i  h•••l n  mi  >  •  .i  |i  a  m  i  >1  \  a n  der 
Pol's  equation  lorm.  usiiiK  time  scalinu.  normalisation.  '■  alinu  i ransforinai ion  eo  . 

A  form  of  \  aii  der  Pols  equation  dii\>‘ii  l»y  a  gaii.ssiaii  uideliand  input  /ri  / )  i>e 
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written  a.s 


—  f  ( 1  —  .r  ■  ).r  +  '}.r  =  iri  / ) 


Setting  lip  tlie  state  erinations 
.1-1  =  r 

.1-2  =  .i-‘i 

.(•j  =  -t.ci.V2 


>'1 
'■  I 


-  f  .i  -  ri 
0 


CJS) 


Uritiiig  it  in  a  matrix  form 

(I  I  0 1  r  u  ■ 

i- =  ^  1  .(■  +  I  iril)  CJ!)) 

0  I)  I)  J  [  1) 

.r(t+)  =  7(r(/r)|  =  (30) 

.  -'••''if ) . 

I’he  ljfo(.k  diagram  for  llie  system  is  gi\«'n  in  Fig.  J.  The  system  was  simulated  for 
di/rereiit  ealties  off  i.e.  0.3.  aiul  L’.O.  Svsirni  simiiJaliou  and  slate  esiimatioii  was 
carried  out  for  various  values  of  rr~  .  the  position  measun'ment  error  variance  .  and  cr*.. 
the  variance  of  w(t).  Table  shows  the  total  mean  square  error  between  the  estimated 
and  the  real  state  values  for  different  noise  levels  for  the  two  estimators.  Table  I  gives 
a  comparison  of  the  execution  times  for  (he  two  (echniciues.  The  numbers  represent 
time  in  msec  for  1000  iterations.  The  simulations  were  carried  out  on  a  STiV  .3/110 
machine  using  the  Motorola  (i88Sl  float ing  point  coprocessor. 

For  the  various  values  of  system  n*>is<-  |..>\v'i  m  i  In- 

mean  .sf(uare  error  in  the  c.tse  ••(  .(.M  (  l-.si  no.ii- -i  .i .  mn' h  -m.ill.  t  ih.in  |..i  I.KI-, 

Though  Table  siii>e,ests  (hat  I  |\  |-  is  .(lien  lU  m  Oini-  ■  ■  ■<  -  nt  t.  ai  times,  it  uas 
lound  that  the  times  beeonie  <  i anparal >le  as  iln-  a  ..I  ih'  -v  'O-m  mi  leases  |  he 
dilh’rence  is  basicallv  due  lo  Ihe  addiluuial  'i.ii<  \aiiali|es  mi  1 1  mImi  iil  lui  the  umi- 
Tmearities  aiul  hence  ihe  compnl at  ions  in  <  ase  .I.\|  1  estimaioi  are  h>i  a  higher 
orrler  system  than  the  one  userl  in  the  FKF  case.  But  if  performance,  iletinei.l  as  the 
mean-square-error.  is  the  criteria,  the  .JMT  estimator  clearl\’  out  performs  llie  FKF 
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CLAMP 


! 

Exteudctl  Kaliiuxii 
Filter 

.JMT  Estimator 

0.1277 

0.or)829:3 

0.17089 

0.:U474 

O.G7;190 

0.34.024 

1.7192 

1.3-30 

:l.l7196 

1.34143 

T«iblc  1:  Estimator  moan  sqnarr  error.  I'KF  vs  .IMT 


.01 

.01 


1.1 

.01 

1.1 

).l 


Exteiiclrd  Kaliiiaii 
Filter 


0.1 


•).8 


'i.N 


JMT  Estimator 


lo..-) 


.0 

l.l 


.0 

.0 


■  '» 


(i.J 


Table  '2:  Exenitioii  times  on  Sun  ?/l  lb,  EKF  vs  .IMT 


and  shows  uo  teudeucy  for  lifter  divergence  as  is  the  case  in  EKF.  The  iinprovenient 
in  the  estimator  performance  far  outweighs  the  increase  in  execution  time. 

Ail  the  above  simulations  were  carried  out  using  e  =  2.0.  Fig.  3.  shows  tlie 
system  state,  the  EKF  estimate  and  the  .IMT  estimate  for  the  above  system,  c  =  2.0. 
rr*  =  1.0  and  =  0.1  were  the  set  of  parameters  used  for  the  figure.  For  the  initial 
part  both  the  estimated  values  are  the  same  but  gradually  the  .IMT  estimate  tracks 
the  input  with  smaller  error  and  also  with  reducing  time-lag.  The  actual  system 
state  \ariable  values  were  computetl  using  a  dilFerent  program,  instead  of  the  .lump 
simulation  theory,  and  the  same  measurements  were  then  used  for  both  the  est  imators 
in  order  to  have  an  unbia.sed  comparison. 


C.  Noil  Linear  Control  System 

riie  next  example  to  be  considered  is  a  noidinear  belay  Control  System.  Kno 
[O]  has  provided  a  discrete  time  analysis  for  calculation  of  the  unit  step  response 
using  pha.se  plane  trajectories.  Here  in  this  section  a  stochastic  model  of  a  nonlinear 
saturating  amplifier  is  considered.  The  control  system  contains  the  amplifier  followed 
by  a  second  order  Low  I’ass  filter  with  negative  output  feedback.  The  block  diagram 
lor  the  system  is  given  in  Fig.  4.  The  system  is  motleled  as  follows 


■<1  =  .'/ 

(31) 

132) 

r  ■  —  / 1  /■  _  1  ,  1 

(33) 

rin'  stale  <’<|n;if ions  can  now  be  ii|>  m  tin-  m;iiri\'  lorm  ,!■, 


with 


(I  In' 

0  -I  I  r 

(I  11  II 


'■|(''*  I 

./('■(t* )  —  rilt^.  )  -I- 


•/(.rcn) 


I  :U  I 


(3.'-.) 
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Figure  3:  \'a-ii  <'Wm’  Pt/I's  LKF  vs  JMT 
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The  system  simulation  ami  estimation  was  raniecl  out  lor  r=0.0l  ser.  rr;;,  =  |),l 
aiul  (T*  =  1.  The  slope  of  the  saturating  anxplilier  was  kept  at  10.0.  Fig.  o  shows 
the  system  state.  EKF  anti  JMT  .state  estiniates  for  the  .system.  From  the  figure  we 
observe  that  though  the  .JMT  estimate  is  noisy  it  still  travks  the  system  state  fairly 
well,  while  in  the  ca.se  of  EKF  the  estimate  dirfrgts  after  some  time.  This  further 
validates  that  the  JMT  estimator  outperforms  the  EKF  and  also  can  he  iisexl  fur  the 
<  ases  where  the  EKF  estiinater  fails. 

5.  Coiicliisiou 

.A  computationally  efficient  state  estimator  for  nonlinear  systems  has  been  devel¬ 
oped.  The  algorithm  has  been  extensively  tested  and  found  to  provide  excellent  state 
estimates  for  both  open-loo|)  and  closed-loop  .systj'ins  with  mefliiim  to  low  output 
signal-to  noise  ratios  and  stochastic  or  «leterininistic  inputs. 

T)ie  basic  idea  of  the  procedure  involves  a  sanipled-data  approach  and  a  separation 
of  the  linear  and  nonlinear  parts  of  the  system.  During  the  sample  interval,  the 
linear  dynamics  propagate  as  expected,  but  the  nonlinear  component  outputs  are  held 
constant  until  the  end  of  the  sample  interval  at  which  time  they  '’  jump"  to  an  updated 
value.  ( 'onse<(uently,  the  technique  i.s  referrerl  to  as  the  Jump  Matrix  Technique  or 
■IMT'  for  short.  The  updating  of  the  states  at  the  sample  instants  in(  roduces  the  elfect 
of  I  he  nonlinear  dynamics  on  the  system.  The  estimator  is  «le' eloped  on  the  l»asis  of 
the  predictor-corrector  conlicuralion. 

J  he  structure  of  the  estimalor  .d>:"rn|im  iii*  .i  !<■  \  i  \  inn  tiain 

matrix.^/i  +  |.  whicli  repre^f'nis  tin-  nbii  i*>n'liip  ib-  o;ii<  imat  ion  <'ir<ir 

belore  and  alter  tin'  nonlinear  update''.  bi  ell*  f  i.  a  .l.n  obiaii  in.iliix  evaluated 

at  the  one-step  predicterl  state  estimati'.  This  ’'|nasi-lin'-,u i’ation  "  <>\  the  nonliiu'ar 
components  is  a  frequently  used  method  for  approximating  the  nonlinear  effects  with¬ 
out  undue  computational  demands  on  the  algorithm.  With  the  on-line  calculation  of 
the  Jacobian  matrix,  the  estimator  takes  on  the  form  of  a  motlified  Kalman  filter,  but 


l() 


w(t) 


l;  Uluck  ul  (lu^  loulrul  >yslt'ni  willi  ;i  aiiiplilit'c 
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TIME 


is  distinct  from  and  is  shown  to  be  superior  to  the  Extended  Kalman  filter. 

Two  examples  of  stochastic  nonlinear  systems  have  l)een  presented.  The  first  was 
Van  der  Pol's  equation  illnstrating  nonlinear  oscillation.  In  this  ca.se  the  mean  square 
error  of  the  estimate  was  consistantlv  smaller  lor  the  .JMT  estimator,  in  comparison  to 
the  EKF.  The  second  was  a  nonlinear  feedback  control  loop  with  an  internal  saturating 
gain  block,  an  example  of  systems  extremely  difficult  to  handle  by  Extended  Kalman 
filtering  technirines.  The  system  wa.s  driven  by  a  deterministic  signal  modeled  by  a 
unit  step  and  a  wideband  noi.se  flistiirl>ance  w(t).  In  adriition.  Gaussian  measurement 
error  v(t)  was  introduced  to  yield  low  output  signal  to  noise  ratios.  In  this  example 
the  EKF  state  estimate  <liverge<l  from  the  actual  state,  while  the  .JMT  estimator 
provided  a  close  estimate  of  the  system  state.  The  JMT  estimator  performed  very 
well  in  both  examples,  and  in  l)oth  cases  gave  l)etter  results  than  the  EKF. 
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ABSTRACT 

Recently,  a  modal  domain  optical  fiber  sensor  has  been  demonstrated 
as  a  sensor  in  a  control  system  for  vibration  suppression  of  a  flexible 
cantilevered  beam.  This  sensor  responds  to  strain  through  a  mechanical 
attachment  to  the  structure.  Because  this  sensor  is  of  the  interferometric 
type,  the  output  of  the  sensor  has  a  sinusoidal  nonlinearity.  For  small  levels 
of  strain,  the  sensor  can  be  operated  in  its  linear  region.  For  large  levels  of 
strain,  the  detection  electronics  can  be  configured  to  coimt  fringes.  In  both 
these  configurations,  the  sensor  nonlinearity  imposes  some  restrictions  on  the 
performance  of  the  control  system.  In  this  paper  we  investigate  the  effects  of 
these  sensor  nonlinearities  on  the  control  system,  and  identify  the  region  of 
linear  operation  in  terms  of  the  optical  fiber  sensor  parameters. 


1.  INTRODUCTION 


In  this  paper  we  are  concerned  with  the  design  of  active  control 
systems  for  vibration  suppression  that  contain  a  modal  domain  optical  fiber 
sensor.  A  modal  domain  optical  fiber  sensor  consists  of  a  coherent  light 
source  (a  laser),  an  optical  fiber  which  responds  to  a  measurand,  and  a 
detector.  By  knowing  the  relationship  between  the  force  on  the  structure  and 
the  output  of  the  optical  sensor,  this  sensor  can  be  incorporated  into  a 
vibration  suppression  control  system  for  a  flexible  beam  (Cox  and  Lindner, 
1991).  In  order  to  use  such  a  sensor  in  a  control  system,  Cox  and  Lindner 
(1991)  developed  an  appropriate  model  for  control  system  design  that 
incorporated  the  following  elements.  When  the  optical  fiber  is  attached  to  a 
flexible  structure,  ^  applied  to  the  structure  will  induce  a  strain  in  the 
optical  fiber  (Mathews  and  Sirkis,  1991).  (The  relationship  between  the 
stress  distribution  in  the  host  material  and  the  optical  fiber  is  currently  an 
active  area  of  research.)  When  the  optical  waveguide  is  subjected  to  strain, 
the  intensity  of  the  light  at  the  fiber  endface  changes  in  a  predictable  way. 
The  first  model  of  this  effect  was  reported  by  Butter  and  Hocker  (1978).  This 
model  was  developed  further  by  Sirkis  and  Haslach  (1991).  This  later  work 
was  extended  slightly  for  modal  domain  sensors  by  Reichard  and  Lindner 
v!1991).  Last,  the  detection  electronics  (Murphy,  et  al.,  (1991))  are  included  in 
the  model. 

Modal  domain  optical  fiber  sensors  are  of  interest  for  control  system 
design  because  they  can  be  configured  to  have  a  long  gauge  length.  Recently, 
this  class  of  sensors,  called  spatial  filters,  have  been  shown  to  have  certain 
advantages  when  implementing  complex  control  systems  (Lindner,  et  al., 
1990).  They  can  also  be  configured  for  optimal  measurement  of  vibrations  in 
control  systems  for  suppression  of  acoustic  radiation  (Lindner,  et  al.,  1991a, 
1991b).  Reichard  (1991)  developed  models  to  characterize  fabrication  and 
modeling  errors  for  modal  domain  sensor  when  they  are  configured  as  spatial 
filters.  Modal  domain  sensors  have  also  been  demonstrated  in  a  control 
system  for  the  suppression  of  acoustic  radiation  (Clark,  et  al,  1992). 

Optical  fiber  sensors  have  certain  advantages  for  instrumenting 
structural  control  systems.  They  are  low  power,  light  weight,  low  mass  and 
EMI  insensitive.  T^ese  sensor  can  also  be  attached  to  or  embedded  in  a 
structure.  1 

The  model  of  the  sensor  from  strain  in  the  fiber  to  intensity  at  the 
endface  of  the  fiber  (sensor  output)  contains  a  sinusoidal  nonlinearity.  Most 
of  the  results  reported  to  date  operate  this  sensor  in  a  range  of  strain  for 
which  the  ou.tput  is  vu  its  linear  range.  In  ^  articular,  the  analysis,  design, 
and  experimental  verification  by  Cox  and  Lindner  (1991)  of  the  vibration 
suppression  control  system  for  a  cantilevered  beam  emphasized  the  linear 


^  See,  for  example  the  SPIE  Proceedings  on  Fiber  Optic  Smart  Structures  and  Skins,  MV, 
1988-1991. 


region  of  operation  of  the  sensor.  In  this  paper  we  extend  the  analysis  of  the 
peiformance  of  the  control  system  into  the  nonlinear  region  of  the  modal 
domain  optical  fiber  sensor.  In  addition,  we  consider  an  alternative  detection 
scheme  which  extends  the  dynamic  range  of  the  sensor.  For  both 
configurations,  we  investigate  the  existence  of  equilibrium  points  and  limit 
cycles  through  a  parametric  study.  We  also  characterize  the  distortion  at  the 
output  of  the  sensor  introduced  by  the  sensor  nonlinearities.  Using  these 
results,  a  modal  domain  optical  fiber  sensor  can  be  sized  for  a  particular 
application  based  on  the  predicted  disturbance  levels. 

In  Section  2  we  introduce  the  model  of  the  sensor  and  quantify  the 
open  loop  distortion  introduced  by  the  sine  nonlinearity.  i..i  Section  3  the 
effects  of  the  nonlinearities  on  the  closed  loop  system  including  the  stability 
of  the  additional  equilibrium  points  and  limit  cycles  are  investigated.  Section 

4  has  the  conclusions. 

List  of  Symbols 

Optical  fiber  parameters 

a  -  core  radius  of  the  optical  fiber 

ni  -  index  of  refraction  of  the  core 

n2  *  index  of  refraction  of  the  cladding 

X  -  wavelength  of  the  laser 

If  -  intensity  of  the  light  at  the  fiber  endface 

r,  0,  z  -  cylindrical  coordinate  system  for  the  optical  fiber 

zf  -  location  of  the  endface  of  the  fiber 

Pi  >  propagation  constant  of  the  ith  electromagnetic  mode 

r  ‘  phase  of  the  interference  pattern  in  the  optical  fiber 

fi  -  first  order  approximation  of  the  phase  to  strain  induced  by  stress 

f  0  (e  * )  -  Q-point  of  the  sensor 

AP  -  fringe  frequency, 

Lf  -  fnnge  len^,  period  associated  with  the  fringe  frequency 

5  •  path  ofthe  optical  fiber  attachment  to  the  structure 
yMoCt)  •  output  of  the  sensor  in  its  analog  configuration 

yFc(t)  -  output  of  the  sensor  in  its  fringe  counting  configuration 

QpcClt)  -  Quantizer  nonlinearity 

AFo  -  Q-point  drift  of  the  sensor 

Ji  -  ith  order  Bessel  function  of  the  first  kind 

Pi  -  power  in  the  ith  harmonic 

THD  -  total  harmonic  distortion 


Structure's  parameters 

Ei(z,t)  -  strain  tensor  in  the  fiber  at  position  z  at  time  t 
Gz  -  axial  stress  in  the  fiber 

e(Gz)  =  Ed  '  strain  resulting  from  a  unidirectional  stress 
£o*  -  prestrain  in  the  fiber  to  set  the  Q-point 

State  space  models 

T]  -  vector  of  the  first  N  modal  amplitudes 

Tie,  ^  *  equilibrium  points  of  the  closed  loop  system 

Vi  -  ith  mode  shape 

eoi  •  ith  natural  frequency  of  the  structure 

Cl  -  diagonal  matrix  of  the  first  N  natural  frequencies  of  the  structure 
^  -  dair--  -  ith  vibrational  mode 

D  -  diagona.  iiatrix  of  the  damping  factors  of  the  modes  of  the  structure 
Ci  •  "mode  shape"  resulting  from  the  sensor  placement 
B  -  matrix  of  modal  influence  coefiSdents 
Xp  -  state  vector  associated  with  the  structure 

(v(t),  x(t),  y<t))  -  input,  state  and  output,  respectively,  of  the  dynamic 
compensator 

(Ac,  Be,  Cc,  ^c)  -  state  matrices  of  the  dynamic  compensator 
kd  -  closed  loop  gain  of  the  control  system 

ki  >  closed  loop  gain  of  the  control  system  which  results  in  poles  at  ijeoi 

kdm  8x  *  aTa  X unum  closed  loop  gain  of  the  control  system  such  that  the 

system  will  not  have  a  limit  cycle  of  frequency  approximately  cOi 

kc  -  steady  state  gain  of  the  closed  loop  control  system 

P(8)  -  transfer  fimetion  of  the  structure 

C(s)  -  transfer  function  of  the  compensator 

N(A)  -  describing  function  of  a  sinusoidal  nonlinearity 

Npq(A)  -  describing  function  of  a  quantizer  nonlinearity 

Ao  -  amplitude  of  a  limit  cycle 


2.  MODEL  OF  THE  OPTICAL  SENSOR 
2.1  Introdvctioii 

The  results  reported  in  this  paper  were  motivated  by  the  experiment 
shown  in  Figure  2.1  (Cox  and  Lindner,  1991). 


Figure  2.1.  Flexible  Beam  with  a  Modal  Domain  Optical  Fiber  Sensor. 

This  experiment  consisted  of  a  flexible  cantilevered  beam  with  a  piezoelectric 
bending  motor  attached  at  the  root  of  the  beam  and  a  modal  domain  optical 
fiber  sensor  attached  along  the  length  of  the  beam.  The  output  of  the  optical 
fiber  was  used  as  a  feedback  signal  to  damp  vibrations  in  the  beam.  To 
describe  the  effects  of  the  modal  domain  sensor  on  the  performance  of  the 
control  system,  we  require  a  model  of  the  sensor.  In  this  section  we  develop 
that  model  and  show  the  effect  of  the  nonlinearity  on  the  sensor  output. 


2.2  Sensor  Model 

2.2.1  IntrodttCtiQn 

A  modal  domain  optical  fiber  sensor,  shown  in  Figure  2.2  as  a  block 
diagram,  consists  of:  1)  polarized  laser  lig^t  source,  2)  lead-in  optical  fiber,  3) 
fiber  sensing  section,  4)  lead-out  fiber,  and  5)  detection  electronics.  This 
particular  optical  sensor  is  distinguished  from  other  optical  fiber  sensors  in 
that  the  waveguide  parameters  and  source  wavelength  are  chosen  such  that 
two  electromagnetic  modes  propagate  in  the  optical  fiber.  A  modal  domain 
optical  fiber  sensor  was  first  demonstrated  by  Layton  and  Buccaro  (1979). 
Elliptical-core  fibers  were  introduced  by  Kim  et  al  (1987)  to  stabilize  the 
intensity  pattern  at  the  fiber  interface.  The  first  use  of  e-core  fibers  as  strain 
gauges  was  reported  by  Blake,  et.  al  (1987).  Murphy,  et  al  (1990)  introduced 
lead-in  and  lead  out  fibers  to  isolate  the  gauge  length  of  the  sensor  and 
discussed  analog  and  fringe  counting  configurations  of  the  detection 
electronics. 
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output 


Figure  2.2.  Block  Diagram  of  the  Modal  Domain  Optical  Fiber  Sensor. 


The  optical  fiber  sensor  in  Figure  2.1  is  mechanically  attached  to  the 
structure  through  an  adhesive.  (Optical  fiber  sensors  can  also  be  embedded  in 
a  composite.)  When  an  external  load  is  applied  to  the  structure,  stress  is 
transferred  from  the  material  to  the  optical  fiber.  The  stress  in  the  optical 
fiber  induces  strain  in  the  fiber.  The  change  in  strain  in  the  optical  fiber  is 
observed  as  a  change  in  the  intensity  at  the  endface  of  the  fiber.  Based  on 
this  observation,  the  model  of  the  modal  domain  optical  fiber  sensor  that  can 
be  described  in  terms  of  the  following  components: 

1.  The  optical  interrogation  of  the  fiber. 

2.  The  transfer  of  stress  from  the  material  to  the  optical  fiber. 

3.  The  strain-optic  interaction. 

4.  The  detection  electronics. 

In  the  analysis  below  we  make  the  following  assumptions  for  each  of 
the  components  of  the  model  above.  The  coordinate  system  of  the  optical 
waveguide  is  shown  in  Figure  2.3 


Figure  2.3.  Coordinate  System  for  the  Optical  Fiber. 

1.  The  optical  fiber  are  strands  of  glass  configured  to  guide  coherent  light. 

(a)  We  assume  that  the  optical  waveguide  has  a  cylindrical  geometry. 
This  geometry  is  parameterized  by  the  core  radius,  a. 
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(b)  We  assume  that  the  cladding  is  infinitely  thick  so  that  the  weakly 
guiding  assumption  (n^  -  n2  «  1)  holds  (Synder  and  Love,  1983). 
These  assumptions  are  commonly  made  in  the  analysis  of  optical 
fibers.  Using  these  assumptions  for  the  waveguide,  the 
electromagnetic  modes  are  called  LP  modes. 

The  implementations  of  modal  domain  sensors  use  elliptical  core  fibers. 
The  analysis  of  the  electromagnetic  modes  that  propagate  in  these 
waveguides  is  a  currently  topic  of  research. 

2.  A  modal  domain  sensor  measures  strain  in  the  structure  through  the 
mechanical  attachment  of  the  fiber  to  structure. 

(a)  We  assume  the  stress  in  the  host  material  is  the  same  as  the  stress  in 
the  optical  fiber.  In  real  applications  the  model  of  the  stress  transfer 
between  the  host  material  and  the  optical  fiber  depends  on  several 
factors  including  the  geometric  orientation  of  the  fiber  on  the  structure 
and  the  properties  of  the  bonding  layer  between  the  structure  and  the 
optical  fiber. 

(b)  We  assume  that  the  strain  in  the  fiber  is  uniform  and  that  the  change 
in  the  cylindrical  geometry  is  such  that  the  waveguide  assumptions 
above  are  still  valid. 

3.  The  strain  optic  interaction  obviously  involves  the  assumptions  in  (1)  and 
(2)  above  as  well  as  the  following  assumption. 

(a)  We  assume  that  there  is  no  dynamic  interaction  between  the  stress 
waves  in  the  glass  of  the  optical  fiber  and  the  electromagnetic  modes 
propagating  in  the  waveguide. 

4.  A  change  in  strain  in  the  optical  fiber  results  in  a  change  in  the  intensity 
at  the  endface  of  the  optical  fiber. 

(a)  We  assume  that  the  electrical  signal  at  the  output  of  the  detection 
electronics  is  proportional  to  the  intensity  of  the  light  to  a  point  at  the 
endface  of  the  fiber.  The  detection  electronics  for  a  modal  domain 
sensor  actually  integrates  the  intensity  of  part  of  the  endface  oi  the 
fiber  to  increase  the  power  coupling  between  the  photodetector  and  the 
light.  This  configuration  of  the  electronics  does  not  impact  the  analysis 
below. 


When  the  optical  fiber  is  in  an  unstrained  state,  the  fiber  acts  as  a 
waveguide  for  the  light.  The  guided  light  can  be  described  in  terms  of 
eigensolutions  of  the  governing  partial  differential  equation  by  using 


separation  of  variables.  Each  solution,  an  electromagnetic  mode,  that 
propagates  in  the  waveguide  depends  on: 

1.  The  geometry  of  the  waveguide. 

2.  The  indices  of  refraction  of  the  waveguide  ni  and  n2. 

3.  The  laser  source  wavelength,  X,. 

For  modal  domain  optical  fiber  sensors  all  of  these  parameters  are  chosen 
such  that  two  electromagnetic  modes  propagate  in  the  waveguide. 

Using  the  weekly  guiding  assumption,  the  spatial  distribution  of  the 
electric  field  of  the  two  electromagnetic  '  that  propagate  in  the 
waveguide  are  of  the  fcrr' 


E(r,0,2)  =  I  r,e)e-jM,  i  =  0,l.  (2.2.1) 

where  the  consu^xcs  |lo  *  are  the  propagation  cnnatanta  .  The  propagation 
constants  depend  on  the  waveguide  parameters  above.  These  modes  in 
‘^.2.1)  interfere  with  each  other  to  produce  a  intensity  pattern  that  varies 
along  the  length  of  the  xiber.  As  a  result  of  this  interference,  the  intensity,  If, 
at  the  endface  of  the  fiber,  zf,  has  the  functional  form 

If(r,0,Zf)  =  Ii(r,0)  +  l2(r,0)co8(Ap2f +a),  (2.2.2) 


2.2.3  Strain  Transfer 

To  predict  the  change  in  the  sensor  output  with  respect  to  a  force 
applied  to  the  structure  requires  a  model  of  the  strain  transfer  from  the 
stru..  the  optical  fiber.  Our  analysis  below  will  assume  that  the 

functional  relationship  between  the  applied  force  from  the  actuator  and  the 
oin  in  the  optical  fiber  is  known.  This  area  is  topic  of  current  research. 


2.2.4  Strain  Qptic  Interaction 

Let  f^ber  be  attached  to  the  structure  along  a  path  S  on  the 

structure.  Let  e\.s,tj  be  the  strain  tensor  for  the  point  s  e  S  at  time  t  in  the 

cylindrical  coordinate  syt  '  the  optical  fiber.  Let  ei  be  the  normal 
component  of  the  strain  tensor  aligned  with  the  longitudinal  axis  of  the  fiber. 
The  strain  i  the  optical  waveguide  causes  three  of  the  parameters  of  the 
waveguide  change: 

1.  The  length  of  the  optical  fiber  is  changed;  i.e.  Zf  s  z{(e). 


2.  The  core  radius,  a,  is  changed. 

3.  The  indices  of  refraction,  ni,  i  =  1,2,  are  changed  through  the 
photoelastic  effect. 

The  dependence  of  these  three  parameters  on  strain  in  the  optical  fiber  is 
reflected  in  the  intensity  of  the  light  at  the  fiber  interface  (2.2.2)  as 


If(e)  =  Ii(e)  +  l2(e)cos(r(e))  (2.2.3) 

I  where 

•  r(e)  =  JgAp(e)(l+ei)ds.  (2.2.4) 

I  The  quantity  r(e)  is  called  the  phase  of  the  electromagnetic  modes.  It  can  be 
shown  (Reichard  and  Lindner,  1991)  that  for  typical  values  of  optical  fiber 

(parameters  the  first  order  effects  of  the  strain  in  the  optical  fiber  is  on  the 
phase.  Henceforth,  we  assume  that  the  intensity  terms  in  (2.2.3),  Ii.  and  I2 
are  independent  of  strain. 

i  The  functional  dependence  of  the  propagation  constants  on  strain  is 

nonlinear.  Let  e*  be  the  strain  distribution  in  the  optical  fiber  when  the 

■  structure  is  in  equilibrium.  The  analysis  proceeds  by  expanding  the 

in#’A<n*an/t  in  (0.0.  in  o  maI/I 


integrand  in  (2.2.4)  in  a  Taylor  series  about  e*  to  yield 
Ap(e)(l+ei)  =  Ap(^)(l+ej) 
+AP(€*)(l+ei)+£^^^(l+e5)(ej-e5)+h.o.t. 

j*i 

Dropping  the  higher  order  terms,  the  phase  (2.2.4)  can  be  rewritten  as 


(2.2.5) 


f  (e)  =  JgAp(e)(l+ei)ds  =  Jg  AP(€*  )(l+e|)  ds 
+  f  Ap(e* )(l+ei)  +  y  )(£;-£] )ds 

IJS  8ej. 

=  fo(e*)  +  fi(e). 

|||Substituting  (2.2.6)  into  (2.2.3)  we  obtain 
_  If(e)  =  Il  +  l2COS(f(E))  =  Ii  +  I2cos(fo(€*)+f“i(e)). 


(2.2.6) 


(2.2.7) 


The  quantity  fo(£*)  is  called  the  Q-point  The  Q-point  is  set  by  static 

Keformation  in  a  non-sensing  section  of  the  optical  fiber  so  that  (2.2.7) 
ecomes 
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If  (e)  =  Ii  + 12  co8(f  0  (e*)  +  fi(e))  =  Ii  + 12  sin(f  i(e)). 


(2.2.8) 


The  model  of  the  modal  domain  sensor  in  (2.2.8)  depends  on  the  strain 
tensor.  The  results  below  do  not  depend  on  this  tensor  and  they  can  be 

explained  more  simply  by  assuming  the  strain  tensor  e  is  induced  by  a  stress 
distribution  in  the  optical  fiber  that  can  be  modeled  by  single  stress 
component  oo>  i-e. 

e(oo(8))  =  eo(8).  (2.2.9) 

Using  (2.2.9)  the  phase  in  the  last  term  in  (2.2.8)  can  be  written  as 

fi(eo)  =  JgApeo(s)ds  =  Apjg  eo(s)ds.  (2.2.10) 

Sensor  output  can  be  written  as 

If  (t)  =  Ii  +  I2  sin(ApJgeo(s,t)ds )  (2.2.11) 

by  substituting  (2.2.10)  into  (2.2.8).  A  graph  of  the  intensity  at  the  endface  of 
the  fiber  vs.  the  applied  stress  is  shown  in  Figure  2.4. 


Figure  2.4.  Intensity  at  the  Fiber  Endface  vs.  Strain. 

From  (2.2.11)  we  see  that  the  sensor  output  has  a  sinusoidal 

nonlinearity.  The  frequency  of  this  sine  function,  AP,  is  called  the  fiinye 
frcuaencv.  The  period  of  this  sine  function. 


Lf 


(2.2.12) 


is  called  the  fi-inge  length. 


2.2.5  Detection  Electronics 


The  light  at  the  endface  of  the  fiber  is  converted  into  an  electrical 
signal  through  the  use  of  a  photodetector.  The  output  of  the  photodetector  is 
ts^en  to  be  proportional  to  the  intensity  of  the  light  at  a  point  of  the  endface 
of  the  fiber.  The  detection  electronics  use  a  highpass  filter  to  remove  the  DC 
bias.  The  constant  gain  of  the  detection  electronics  is  taken  to  be  one  for 
simplicity. 

Combining  all  of  the  results  above,  the  model  of  the  modal  domain 
sensor  is 


=  sin|ApJgeo(s,t)dsj.  (2.2.13) 

When  the  modal  domain  sensor  is  used  in  its  analog  configuration,  the 
analog  signal  from  the  detection  electronics  (2.2.13)  is  processed  directly. 

The  second  detection  scheme  cotmts  the  number  of  2n  phase  shifts  the 
output  of  the  sensor  experiences.  We  call  this  detection  scheme  fringe 
counting.  To  model  this  detector,  we  define  the  quantizer  nonlinearity  as 

Qpc  (^)  =  nLf ,  if  (n  -  J)Lf  ^  p  <  (n + ^)Lf .  (2.2.14) 

The  output  of  the  detector  is 

ypc(t)  =  Qpc[sin(ApJgeo(s,t)d8)].  (2.2.15) 


2.2.6  Sensor  Parameters 

The  discussion  above  has  identified  the  key  parameters  of  a  modal 
domain  optical  fiber  sensor  as  far  as  the  control  system  is  concerned. 

1.  The  most  important  parameter  is  the  fringe  frequency,  Ap.  For  analog 
detection,  this  parameter  essentially  determines  ^e  dynamic  range  of  the 
sensor  as  can  be  seen  from  Figure  2.4.  For  large  amplitude  strains,  the 
output  of  this  detector  is  dominated  by  the  sine  nonlinearity  in  (2.2.13). 
For  fnnge  counting,  the  fringe  frequency  determines  the  quantization 
levels  of  the  sensor  output. 

2.  The  second  important  parameter  of  this  sensor  is  the  Q-point.  During  the 
operation  of  the  sensor,  the  Q-point  can  drift.  If  we  let  AFo  denote  the  Q- 
point  drift  the  sensor  output  (2.2.13)  is  given  by 
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yj,D(t)  =  sin(Aro  +  Ap^eoCs.Uds). 

In  the  sections  below,  the  effect  of  Q-point  drift  is  also  discussed. 


(2.2.16) 


Consider  a  structure  which  has  been  instrumented  with  a  modal 
domain  optical  fiber  sensor.  If  this  structure  is  subjected  to  a  sinusoidal  force 
such  that  the  sensor  experiences  a  stress  of,  say. 


Oo(s,t)  =  F(s)sino)ot, 


(2.2.17) 


then  the  mathematical  model  implies  that  the  output  of  the  sensor  will  be 


f  Co  ( s, t)ds  =  A  sincoot 

•S 


(2.2.18) 


will  also  be  sinusoidal  with  amplitude  A,  say.  The  relationship  between  F(s) 
and  A,  which  underlies  the  analysis  present^  here,  depends  on  many  factors 
including  location  of  l^e  force,  material  properties  of  the  structure,  fir^uency, 
etc.  In  addition,  the  sensor  we  consider  here  could  be  a  long  gauge  leng^ 
sensor.  If  this  sensor  is  used  in  a  long  gauge  length  configuration,  the  effects 
of  sensor  placement  should  be  consider^  on  a  case  by  case  basis. 

2.3.  Large  Amplitude  Nonlinear  Diatortion 

In  this  subsection  we  consider  the  modal  domain  sensor  with  analog 
detection  electronics  (2.2.13).  For  large  levels  of  strain,  however,  the  sine 
nonlinearity  introduces  distortion  into  die  signal.  In  this  section  we  quantify 
this  distortion. 

Suppose  that  a  sinusoidal  force  on  the  structure  results  in  a  strain  in 
the  optical  fiber  that  is  given  by  (2.2.18).  Substituting  this  expression  for 
strain  into  the  model  of  the  sensor  (2.2.11)  we  get 


yMD(f)  =  sin|Aro  +  A^sino)ot|. 


(2.3.1) 


Obser'^  ig  that  the  sensor  output  is  a  periodic  function,  (2.3.1)  can  be  written 
as  the  Fourier  series 


yMD(t)  =  Jo(  ^aA)sin  ATo  +  5^2Jn(ApA)sin  ATq  cosncoot 

n=2 


2  Jn  (ApA)  COS  ATq  sin  neoot 


(2.3.2) 
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where  Jn  is  an  nth  order  Bessel  function  of  the  first  kind. 

When  «  1  and  the  Q-point  drift  is  zero,  ATq  =  0,  the  power  in  the 
signal  yxoCt)  is  concentrated  in  the  first  term 

»  2Jx(ApA)sina)ot  =  A^sincoot.  (2.3.3) 

With  increasing  amplitude  of  the  strain,  A,  more  power  is  shifted  into  the 
higher  order  harmonics  in  (2.3.2).  Similarly,  Q-point  drift  causes  the 
appearance  of  a  DC  term  as  well  as  to  shift  power  into  the  components  yiiD(t) 
wlfich  are  out  of  phase  with  strain.  This  phenomenon  can  be  quantified  by 
defining  the  power  in  each  harmonic  as 


^2  J„  (  A^A)  sin  ATo  j 
2 


n  even. 


(2.3.4) 


with  a  similar  definition  for  n  odd.  Then  the  total  harmonic  distortion,  THD, 
is  defined  as 


oo 

THD  =  X 

i=2 


|i-xl00% 


(2.3.5) 


The  THD  is  shown  in  Figure  2.5  as  a  function  of  the  Q-point  drift,  ATo,  the 

fringe  frequency,  A^,  and  the  amplitude  of  the  strain,  A.  Figure  2.5 
essentially  describes  the  linear  region  of  a  modal  domain  sensor  when  it  is 
used  to  sense  vibrations  in  a  flexible  structure. 

In  a  fringe  counting  configuration,  this  sensor  will  also  introduce 
distortions  in  the  sensor  output  b^ause  of  the  quantization.  This  effect  has 
long  been  studied  in  the  signal  processing  literature  and  it  will  not  be 
pursued  here. 


CLOSED  LOOP  ANALYSIS 
3.1  Introduction 


In  this  section  we  consider  the  effect  of  the  sine  nonlinearity  of  the 
odal  domain  optical  fiber  sensor  on  the  performance  of  a  vibration 
ippression  control  system.  To  that  end  we  assume  that  the  structure  is 
Libeled  by  N  vibrational  modes  as 

Ti(t)  +  Dfi(t)  +  02Tj(t)  =  Bu(t)  (3,1.1) 

here 


’ni] 

11=  :  ,D=  = 


(3.1.2) 


f  we  assume  that  the  strain  induced  in  the  optical  fiber  by  the  vibrations  of 
he  structure  can  be  expressed  ^  *  i  separation  of  variables  expansion  using 
he  basis  functions 


N 

eo(s,t)  =  y  Vi(s)iii  (t), 


(3.1.3) 


The  state  space  representation  of  the  compensator  is  given  by 


Xc  =  A^Xc+BcVit), 
Ye  =  CcXc  +DcV(t). 


If  the  systems  in  (3.1.6) '  (3.1.7)  are  interconnected  according  to 

u(t)  =  yc(t),  and  v(t)  =  -kdyMD(t),  (3.1.8) 


then  the  resulting  state  space  representation  fi>r  the  closed  loop  system  is 
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(3.1.9) 


The  closed  loop  system  is  shown  in  Figure  3.1. 


Compensator  Feedback  Gain 

Figure  3.1.  Closed  Loop  System. 


In  constructing  the  closed  loop  system  (3.1.9)  we  only  assumed  that  the 
compensator  (3.1.7)  gives  acceptable  closed  loop  performance  with  respect  to 
the  linear  model  (3.1.6).  The  actual  design  of  the  compensator  could  have 
been  carried  out  on  a  reduced  order  model,  say,  a  sul^t  of  the  modes  in 
(3.1.6).  In  that  case  the  models  in  (3.1.6)  and  (3.1.9)  would  also  include 
residual  modes. 

3.2  Equilibrium  Points 

The  sine  nonlinearity  in  the  sensor  introduces  multiple  equilibrium 
points  into  the  linear  design  model.  These  equilibrium  points  are  computed 
by  setting 


(3.2.1) 


Substituting  (3.2.1)  into  (3.1.9)  we  obtain 


0  =  f), 

0  =  +  BCcXce  -  BDgKd  8in(  ApCtie ). 

0  =  AeXea  -  Bckd  sin(  A^Tle  ). 


(3.2.2) 


Assuming  A~^  exists,  the  last  two  equations  in  (3.2.2)  can  be  rewritten  as 

Tig  =  -Q-2B^kckd  sin(  A^Crig ),  (3.2.3) 

where 

ke=Dc-CeA-iBg.  (3.2.4) 

Note  that  (3.2.4)  is  the  steady  state  gain  of  the  compensator. 

The  equilibrium  points  can  be  foimd  by  parameterizing  (3.2.3)  as 

Tig  =  (fl-2Bkckd)Y.  (3.2.5) 

Using  (3.2.5)  in  (3.2.3)  we  see  that  y  should  satisfy 

Y  =  -  sin(  ApCa-2BkckdY).  (3.2.6) 

The  scalar  y  can  be  found  graphically  by  plotting  both  sides  of  (3.2.6)  on  the 
same  graph.  A  typical  graphical  solution  to  (3.2.6)  is  shown  in  Figure  3.2. 


-ain(A^Q~*BkckdY) 


Figure  3.2.  Calculation  of  Equilibrium  Points. 

A  possible  set  of  equilibrium  points  is  also  shown  in  Figure  3.2. 


Based  on  the  analysis  above  we  can  draw  the  following  conclusions. 

1.  The  assumption  that  A  exists  implies  that  the  model  of  the  structure 

does  not  have  any  rigid  body  modes.  Since  modal  dc.aain  sensors 
respond  to  strain  this  assumption  is  justified. 

2.  If  A~^  does  not  exist,  Ac  has  a  zero  eigenvalue  corresponding  to  an 

integrator.  In  this  case  (3.2.2)  may  admit  multiple  solutions  for  Xce. 

3.  If  kc  =  0,  the  compensator  has  a  zero  at  the  origin  which  corresponds  to 

pure  velocity  feedback.  In  this  case  the  nonlinear  system  has  only  one 
equilibrium  point  at  the  origin. 

4.  Suppose  that  in  the  closed  loop  system  (3.1.9),  the  structure  is  modeled  by 

a  single  mode  with  a  colocated  force  input  and  displacement  output. 
Suppose  further  that  the  compensator  is  a  simple  constant  kc  -  Dc  >  0 
and  kd  >  0.  Then  the  sign  convention  we  have  chosen  would  result  in 
increased  stifihess  in  the  closed  loop  system.  If  kd  <  0  the  system 
would  be  unstable  for  large  enough  gains. 

5.  The  number  of  equilibrium  points  is  determined  by  the  frequency  of  the 

sine  function  in  (3.2.6).  From  Figure  3.2  we  see  that  as  tiie  fr^uency 
of  the  sine  function  is  increased,  the  number  of  equilibrium  points  is 
increased.  The  presence  of  multiple  equilibrium  points  can  be 
characterized  in  terms  of  three  fisictors. 

i)  The  factor  is  the  steady  state  gain  of  the  structure’s  transfer 
function.  This  factor  includes  the  placement  of  the  actuator  and 
sensor  through  B  and  C  matrices  as  well  as  the  structure's  modes. 

ii)  The  second  factor  is  kckd.  This  factor  represents  the  steady  state 
gain  supplied  by  the  compensator.  As  this  gain  is  increased,  the 
number  of  equilibrium  point  s  increases  as  expected. 

iii)  The  third  factor  is  the  fnnge  frequency,  Ap,  of  the  modal  domain 
sensor.  Increasing  the  fringe  frequency  decreases  the  dynamic  range 
of  the  sensor,  and  increases  the  number  equilibrium  points. 

6.  The  graph  in  Figure  3.2  assiuned  that  the  product  of  all  of  the  factors  in 

(3.2.6)  was  positive.  If  that  product  is  negative,  then  the  sine  function 
is  shifted  by  180°.  The  results  above  remain  essentially  unchanged. 

7.  If  the  sensor  has  some  Q-point  drift,  then  (3.2.3)  becomes 


T|g  =  -Q-2B^kckd  sin(Aro  +  ApCrie). 


(3.2.7) 
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The  Q-point  drift  introduces  a  phase  shift  into  the  sine  function  in 
Figure  3.2  that  causes  all  of  the  equilibrium  points  to  shift.  In 
particular,  the  equilibrium  point  at  the  origin  is  shifted  to  a  nonzero 
value. 


3.3  Stability  of  die  Equilibrium  Points 

Stability  Analysis 

Next  we  use  Lyapunov's  first  method  to  check  the  stability  of  the 
equiUbrium  points  of  the  closed  loop  system  (3.4.2).  To  this  end  we  linearize 
(3.4.2)  to  obtain 


where 


(3.3.1) 


a2i  =  -d-2  -  cos^A^TiaJkaApBDcC, 
a3i  =  -cos^ApCTieJk^iApBC 
We  call  the  factor 


-kdCOs(ApCTig)  (3.3.3) 

the  effective  loop  gain.  For  the  equilibrium  point  rieo  ==  0,  the  effective  loop 
gain 


-kd  cos(  ApCtieo )  =  -kd  (3.3.4) 

has  its  desired  value.  That  is  to  say  the  poles  of  the  system  linearized  around 
the  equilibrium  point  the  origin  correspond  to  the  closed  loop  poles  that 
resulted  from  the  compensator  designed  using  the  linearized  plant  model.  At 
nonzero  equilibrium  points,  the  effective  loop  gain  has  a  value  which  is 
proportion^  to  the  ilope  of  the  term 

-sin(ApCn-2BkckdY)  (3.3.6) 


at  the  intersections  with  the  45*’  line.  See  Figure  3.2. 


We  can  draw  the  following  conclusions: 

1.  We  assume  that  the  structure's  poles  are  exactly  on  the  imaginary  axis 

(no  natural  damping),  and  that  the  poles  depart  into  the  left  hand 
plane  as  the  compensator  gain  is  increased  from  0  to  ka  as  shown  in 
Figure  3.3.  If  the  sign  of  the  effective  gain  is  reversed,  then  the  poles 
of  the  linearized  system  in  (3.1.5)  will  depart  into  the  right  half  plane 
for  small  values  of  and  the  corresponding  equilibrium  point  are 
likely  be  imstable.  Figure  3.2  shows  that  the  equilibrium  points  are 
likely  alternate  between  stability  and  instability  for  systems  with  no 
natural  damping. 

2.  If  the  structure  has  some  natural  damping,  the  reasoning  in  (1)  still  holds 

qualitatively. 

3.  Suppose  that  the  closed  loop  (3.1.9)  system  (3.3.1)  has  at  least  one 

nonzero  equilibrium  point  corresponding,  say,  to  the  first  intersection 
of  the  curves  in  Figure  3.2  for  positive  y*  At  this  intersection  the  slope 
of  the  sinusoid  (3.3.5)  is  negative.  Also  suppose  that  the  system  has 

enough  damping  so  that  for  some  value  of  the  fringe  frequency, 
this  equilibrium  point  is  stable.  As  the  fringe  firequency  increases, 
reducing  the  dynamic  range,  the  slope  of  the  intersection  of  the  two 
curves  in  Figure  3.2  increases.  As  the  fnnge  frequency  increases  the 
magnitude  of  the  effective  gain  increases.  As  the  fringe  frequency 
increases  to  infinity,  eventually  this  stable  equilibrium  point  wiU 
become  unstable. 

Now  consider  the  next  equilibrium  point  of  increasing  y  in  Figure  3.2. 
As  the  fringe  frequency  increases,  the  slope  of  the  sinusoid  increases 
and  this  equilibrium  point  remains  stable.  Increasing  the  fringe 
frequency  decreases  the  stability  region  of  each  equilibrium  point. 

4.  If  the  sensor  has  some  Q-point  drift,  then  the  effective  gain  becomes 

-kdCOs(Aro  +  ApCqej.  (3.3.6) 

The  Q-point  drift  causes  the  phase  of  the  sinusoid  in  Figure  3.2  to  shift. 
For  Q-point  drifts  larger  than  90°,  an  equilibrium  point  can  change 
from  stable  to  unstable. 


3.4  Limit  Cycles 

Next  we  investigate  the  presence  of  limit  cycles  in  a  control  system 
which  incorporates  a  modal  domain  sensor.  If  the  structure's  transfer 
function  is 


(3.4.1) 


P(8)  =  C(82I  +  D8  +  Q2)~^B 

and  the  compen8ator8  transfer  function  is 

C(s)  =  [Cc(sl- Acf^Bc  +De].  (3.4.2) 

then  the  closed  loop  system  (3.1.9)  is  shown  in  the  block  diagram  in  Figure 
3.1.  We  also  assume  that  the  closed  loop  system  has  only  one  equilibrium 
ooint  at  the  origin. 

To  investigate  the  possible  presence  of  limit  cycles,  we  use  describing 
functions  (Ath  ’*ton,  1975).  We  assume  that  the  limit  cycle  at  the  input  of  the 
nonlinearity  in  ^igur<'  \  has  the  form 

Aocosoot,  (3.4.3) 


and  we  look  for  condi..ons  imder  which  such  a  signal  could  be  supported 
throughout  the  system.  To  that  end,  the  describing  function,  N(A),  for  the 
sinusoidal  nonlinearity  in  Figure  3.1  is 

N(A)  =  ^^^^.  (3.4.4) 

A 


The  graph  of 


N(A)  _  2Ji(AgA) 
Ap  ~  ApA 


(3.4.5) 


is  shown  in  Figure  3.3. 


Figure  3.3.  Describing  Function  of  Sine  Nonlinearity. 

If  this  system  has  a  limit  cycle,  then  a  solution,  (Ao,  (Oo),  to  the  equation 


l+N(A)kdC(jo))P(jfi))  =  0 


(3.4.6) 


should  exist  for  some  Ao  and  o)o-  Solutions  to  (3.4.6)  can  be  found  by  plotting 
the  root  locus  of 


where 


l+k(A)C(s)P(s)=0 


(3.4.7) 


k(A)  =  N(A)kd,forO^A<«..  (3.4.8) 

If  the  root  locus  intersects  the  imaginary  axis  at  coo  for  a  value  of  A  =  Ao,  then 
a  limit  (^de  of  the  form  (3.4.3)  is  predicted. 

To  investigate  the  presence  of  limit  cydes  for  this  system,  we  recast 
the  root  locus  in  (3.4.7)  -  (3.4.8)  in  terms  of  the  root  locus  design  of  the 
original  Unear  system  as  shown  in  Figure  3.4. 


Figure  3.4.  Root  Locus  of  One  Natural  Frequency 
of  the  Structure. 


Rewrite  (3.1.7)  as 


v(t)  =  -kCTi(t).  (3.4.9) 

Then  as  k  varies  from  -  oo  to  «>,  the  poles  of  the  linear  system  trace  out  the 
usual  root  locus  curves.  In  particular,  the  closed  loop  poles  are  given  by  k  s 
kd. 


From  (3.4.8)  we  can  see  that  for  A  =  0,  k  =  k^Ap.  Let  Ao  be  the  value 
of  A  that  solves 
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0. 


(3.4.10) 


Then  this  value  of  A  corresponds  to  k  =  0,  i.e.  the  open  loop  poles  of  the 
structure.  Note  that  larger  values  of  A  correspond  to  a  sign  change  in  the 
compensator  gain  k.  As  A  goes  to  infinity,  the  gain  in  (3.4.8)  oscillates 

between  its  Tnininnim  and  maximum  values,  -0.4d^,  and  respectively. 
Thus,  as  a  function  of  A  the  variation  of  each  pole  of  (3.4.7)  is  a  single  line 
beginning  on  the  positive  root  locus  and  ending  at  the  open  loop  pole  which 

doubles  back  on  itself.  This  behavior  for  the  natural  frequency  (Oi  of  the 
structure  is  shovm  in  Figure  3.5. 


Stable  limit  cycle 

Im 

with  amplitude  Aqj 

Unstable  limit  cycle  / 

with  amplitude  A^jj 

- SIh 

Figure  3.5.  Root  Locus  Of  the  Describing  Function 
of  One  Natural  Frequency  of  the  Structure. 


Let  ki  be  the  gain  in  (3.4.7)  such  that  the  pole  at  joi  is  moved  to  the 
imaginary  axis.  Suppose  that  ki  <  0  as  suggested  in  Figure  3.5.  Next  define 
Amin  as  the  constant  that  minimizes 


imnN(ApA). 

Finally,  define  the  feedback  gain  kdmax  hy 


ki 

apn(apa^„) 


=  k 


dmax* 


(3.4.11) 


(3.4.14) 


For  a  given  structure,  actuator,  and  modal  domain  sensor,  if  the  gain  of  the 
closed  loop  system  is 


kd  >  kdmax 


(3.4.15) 


then  the  closed  loop  system  will  admit  a  limit  cycle  with  a  frequency  of 
approximately  ooi  as  shown  in  Figure  3.5. 

Based  on  the  analysis  above,  we  can  draw  several  conclusions. 

1.  For  the  example  shown  in  Figure  3.5,  if  limit  cycles  exist,  they  are 

approximately  at  the  same  frequency  as  the  natural  frequency  of  the 
structure. 

2.  The  value  of  the  gain  ki,  which  determines  kdmax>  will  be  a  function  of  the 

material  properties  of  the  structure  as  well  as  the  sensor  and  actuator 
placement. 

3.  Ifkd,  satisfies 


0<kd<kd^  (3.4.16) 

then  this  analysis  does  not  predict  any  limit  cycles.  This  range  of  gains 
depends  on  the  fringe  frequency  as  shown  in  (3.4.14).  As  the  firinge 
frequency  increases,  k<]inax  decreases  as  we  would  expect. 

4.  The  amplitude  of  a  limit  cycle  is  determined  firom 

ki  =  N(A^o)kd  (3.4.17) 

which  may  have  multiple  solutions  for  Ao.  For  a  given  solution  which 
satisfies 


A^o  =  constant 


(3.4.18) 


the  amplitude  of  the  limit  cycle  Ao  will  increase  as  the  fringe  firequmicy,  A^, 
decreases. 

5.  Suppose  that  a  closed  loop  system  admits  the  existence  of  multiple  limit 

cycles  with  amplitudes,  Aoi  <  *"  <  Ao(i4-i)*  It  can  be  shown  (Atherton, 
1975)  using  standard  arguments  that  the  limit  cycle  corresponding  to 
the  amplitude  Aqi  is  unstable,  and  the  limit  cycle  corresponding  to  A02 
is  stable.  The  limit  cycles,  ordered  according  to  the  magnitude  of  their 
amplitude,  oscillate  between  stable  and  unstable,  the  first  being 
unstable.  This  analysis  is  applied  to  each  of  the  structure's  natural 
frequencies. 

6,  If  a  closed  loop  system  does  admit  at  least  one  limit  cyde,  a  rough  measure 

of  the  linear  operating  region  could  be  taken  to  be  the  region  inside  the 
limit  cycle  with  smallest  amplitude.  For  a  given  system,  this  region 
could  be  calculated  from  the  analysis  above. 


7.  This  analysis  focused  on  one  gain  for  which  the  root  lod  crossed  the 
imaginary  axis.  This  analysis  could  be  repeated  at  other  gains  if  the 
root  locus  is  more  complicated  than  shown  in  Figure  3.5. 


3.5  Fringe  Coimtiiig 

In  the  previous  subsection  we  analyzed  the  effect  of  a  modal  domain 
optical  fiber  sensor  in  a  feedback  loop  when  it  was  in  its  analog  configuration. 
In  this  subsection  we  consider  a  feedback  loop  with  a  modal  domain  sensor  in 
its  fringe  counting  configuration.  In  this  configuration,  the  model  of  the 
sensor  is  a  quantizer  where  the  quantization  levels  are  the  firinge  length  Lf  as 
shown  in  (2.2.14-15).  The  upper  bound  on  the  sensor  output  is  determined  by 
the  material  properties  of  the  structure  or  the  optical  fiber;  the  strain  level  at 
which  the  material  enters  its  plastic  region.  Another  constraint  is  posed  by 
the  digital  hardware.  Here  we  assiune  that  the  upper  bound  is  infinite. 

The  model  of  the  closed  loop  system  incorporating  an  modal  domain 
sensor  in  a  fnnge  counting  configuration  can  be  obtained  from  (3.1.9)  by 
replacing  the  sine  nonlinearity  by  ^e  quantization  nonlinearity  (2.2.15).  The 
result  is 
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(3.5.1) 


The  equilibrium  points  satisfy 


Y  =  “Qpc(  ATq  +  A^O-2Bkckdy) 

where  the  derivation  of  (3.5.2)  follows  the  derivation  of  (3.2.5-6).  Equation 
(3.5.2)  can  be  solved  graphically  as  shown  in  Figure  3.6. 


(a)  ApC«'2Bk,kd  >  0  (b)  ApCQ-2Bkckd  <  0 


Figure  3.6.  Equilibrium  Points  of  the  Fringe  Counter. 


In  Figure  3.6a  we  assumed  that 

ATo  =  0,  and  ApCQ-2Bkcki  >  0.  (3.5.3) 

In  Figure  3.6a  it  is  dear  that  there  is  only  one  equilibrium  point  at  the  origin. 
It  is  also  easy  to  see  that  Q-point  drift,  if  large  enough,  could  cause  ^t 
equilibrium  point  to  jump  to  a  nonzero  value,  or  have  no  equilibrium  point  at 
all. 


The  stability  of  the  equilibrium  points  can  be  investigated  using  the 
approach  in  Section  3.2.  In  Figure  3.6b  assume  that  the  equilibrium  point 
does  not  correspond  to  one  of  the  jump  discontinuities  of  the  quantizer 
function.  Then  linearizing  the  system  aroimd  this  equilibrium  point  yields 


(3.5.4) 


Thus,  if  the  open  loop  system  with  the  compensator  is  asymptotically  stable, 
each  of  the  equilibriiun  points  vdll  also  be  stable  independent  of  the  fringe 

frequency  A^.. 

To  investigate  the  presence  of  limit  cycles,  the  describing  function 
analysis  in  the  previous  section  can  be  used.  The  describing  function  for  the 
quantization  nonlinearity  in  (2.2.15)  with  Q-point  drift  is 


Nfc(A)  =  0,  0^A<^. 

A 


and 


(3.5.5) 


(3.5.6) 


where 


(3.5.7) 


n  =  2m-l, 

(M'*’  —  +  ATq  <  a  <  (M"*"  +-^jLf  +  ATq, 

(M-  -  J)Lf  -  ATo  <  A  <  (M+  +i)Lf  -  ATq. 

A  plot  of  this  describing  function  for  several  values  of  ATo  is  shown  in  Figure 
3.8. 


Proceeding  as  in  the  last  section,  we  note  that  the  describing  function 
is  always  positive.  Hence,  if  the  root  locus  for  the  linear  system  is  always  in 

the  left  hand  plane  for  0  ^  k  ^  k<i,  then  this  analysis  does  not  predict  any  limit 
cycles  for  the  fringe  counting  configuration  of  a  modal  domain  optical  fiber 
sensor.  Li'  t  cycles  could  occur,  however,  if  the  linear  system  is 
conditionally  ^e. 
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Figure  3.7.  Describing  Function  of  the  Quantizer  Nonlinearity. 


4.  CONCLUSION 


In  this  paper  we  have  considered  vibration  suppression  control  systems 
for  flexible  structures  which  incorporate  a  modal  domain  optical  fiber  sensor. 
We  have  described  the  nonlinearities  of  these  sensors  as  sinusoidal  when  the 
sensor  is  operated  in  its  analog  configuration  and  as  a  q\iantizer  when  the 
sensor  is  configured  for  fnnge  counting.  The  sinusoidal  nonlinearity  can 
introduce  multiple  equilibritun  points  and  limit  cycles.  The  fringe  counting 
configuration  can,  under  some  circumstances,  intn^uce  limit  cycles. 

The  analysis  in  this  paper  was  carried  out  in  terms  of  the  parameters 
of  the  sensor,  the  fringe  frequency  and  the  Q-point  drift.  Given  a  structure 
with  a  modal  domain  optical  fiber  sensor,  these  results  can  be  used  to  predict 
the  nonlinear  behavior  in  terms  of  potential  disturbances,  or  to  size  the 
sensor  to  avoid  undesirable  nonlinear  behavior. 
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Abstract 

A  Gaussian  sum  estimation  algorithm  has  previously  been  developed  to  deal  with 
noise  processes  that  are  nonGaussian.  Inherent  in  this  algorithm  is  a  serious  growing 
memory  problem  that  causes  the  number  of  terms  in  the  Gaussian  sum  to  increaise 
exponentially  at  each  iteration.  A  modified  Gaussian  sum  estimation  adgorithm  is 
developed  here  that  avoids  the  growing  memory  problem  of  the  previous  algorithm 
while  providing  effective  state  estimation.  A  simulation  example  is  presented  which 
illustrates  the  new  nonGaussian  estimation  technique. 

I.  INTRODUCTION 

Although  the  Gaussian  assumption  for  modeling  many  types  of  signals  md  noise 
processes  is  valid  in  a  wide  range  of  applications,  in  practice  it  may  not  be  a  good 
assumption  for  some  signals.  Estimation  techniques  therefore  need  to  be  developed  that 
can  cope  effectively  with  the  nonGaussian  nature  of  certain  signals.  One  such  approach 
previously  developed  by  Sorenson  and  Alspach  [1]  is  the  Gaussian  sum  technique.  The 
density  function  of  each  nonGaussian  process  of  the  system  is  approximated  by  a 

This  work  was  supported  by  the  Office  of  Naval  Research,  grant  N00014-89-J-3123. 
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weighted  sum  of  Gaussian  density  functions.  The  conditional  density  of  the  state  given 
the  available  measurement  sequence,  necessary  in  the  Kalman  filter  development,  is 
updated  using  the  Gaussian  sum  approximations  and  Bayes’  rule. 

A  serious  limitation  in  the  approach  used  by  Sorenson  and  Alspach  is  that  the 
number  of  Gaussi£in  terms  used  to  approximate  the  density  functions  increases  at  each 
time  iteration.  An  alternate  approach  is  required  to  alleviate  this  limitation.  A 
modified  estimation  algorithm  is  developed  here  based  on  an  adaptive  Kalman  filter 
scheme  first  presented  by  Magill  [2],  and  extended  by  Moose  [3].  Essentially,  a 
parameter  vector  is  used  to  uniquely  describe  each  Gaussiam  term  in  the  estimator.  The 
parameter  vector  is  restricted  to  be  randomly  chosen  from  the  same  finite  set  of  known 
values  at  each  iteration.  By  using  a  nonGaussian  signal  model  in  conjunction  with  the 
modified  formulation  of  the  Gaussian  sum  estimator,  the  number  of  Gaussian  terms  at 
each  iteration  of  the  estimator  will  be  fixed,  thereby  avoiding  the  growing  memory 
problem. 

The  nonGaussian  signal  model  and  associated  Gaussian  sum  density  approximation 
is  developed  in  section  II.  The  modified  estimation  algorithm,  termed  the  modified 
Gaussian  sum  (MGS)  adaptive  filter  [4],  is  developed  in  section  III.  A  simulation 
example  implementing  the  MGS  adaptive  filter  is  presented  in  section  IV.  The 
conclusions  are  given  in  section  V. 

II.  NONGAUSSIAN  SIGNAL  MODEL  DEVELOPMENT 

Let  u  be  a  r2mdom  noise  process  or  raindom  input  signal  with  a  nonGaussian 
density  function.  It  can  be  modeled  as  the  sum  of  two  statistically  independent  random 
processes 

u  —  b  +  n  (1) 
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The  first  term,  b,  is  a  semi-Markov  process  with  state  transitions  governed  by  the 
transition  probability  matrix  of  a  conventional  Markov  process.  Mau'kov  processes  have 
the  property  that  a  transition  is  made  at  every  time  instant.  The  transition  may  return 
the  process  to  the  state  it  previously  occupied,  but  a  transition  occurs  nevertheless. 
However,  in  the  semi-Majkov  case,  the  amount  of  time  between  transitions  is  a  random 
variable  [5].  The  value  of  b  is  randomly  selected  from  a  fixed  set  of  discrete  values, 
characterized  by  a  delta  probability  density  function 

M 

p{b)=^P,S(b-b,)  (2) 

i=l 

with 


M 

£;p,  =  l;  Pi>0  fori  =  1,2,..., A/  (3) 

i=l 

This  process  can  be  thought  of  as  a  randomly-switching  bias,  each  bias  value  6,-  having 
probability  P,. 

The  second  term,  n,  is  a  zero  mean  white  Gaussiain  process  with  variance  a\.  With 
both  densities  known,  the  density  function  of  u  can  be  found  using  the  convolution 
relationship  between  u,  n,  and  b  [6] 


P(w)=  f  Pn(»-^)PbWdb 

J  —00 


(4) 


where  p„(u  —  b)  is  the  Gaussian  density  with  n  =  u  —  b 


Pn(w  -  &) 


(5) 
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Substituting  (2)  and  (5)  into  the  convolution  integrcil  (4)  gives 

Interchanging  integration  and  summation 

M  [foo  , 

p(t<)=5:p,  1  e  (7) 

fei  L'*  -«>N27ra„  J 

Using  the  sifting  property  of  the  delta  function  [7] 

/(a)=  f  f{x)S{x  -  a)dx  (8) 

J  —00 

the  integral  of  (7)  is  evaluated  as 

or 

P(«)=  (10) 

«=i 

Thus,  the  nonGaussian  density  function  of  u  can  be  modeled  as  a  Gaussian  sum.  The 
weight  P,  of  each  Gaussian  term  is  the  probability  of  the  bias  term.  The  bias  term 
bi  is  restricted  to  be  randomly  selected  from  the  same  fixed  set  of  bias  v2Jues  at  each 
iteration.  Using  this  model  in  conjunction  with  the  modified  Gaussian  sum  adaptive 
filter  developed  in  the  next  section  avoids  the  growing  memory  problem  of  Sorenson  and 
Alspach’s  development. 
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Selecting  the  parameters  P„  6j,  and  <t„  in  (10)  to  obtain  the  “best”  approximation 
Pqs  to  some  actual  nonGaussian  density  function  is  accomplished  by  means  of 
minimizing  the  X*'  norm 


WPa 


,  f  oo  M 

-  Pgs  11  =  I  Pa{^)  -  E  al] 

-  oo  ,=1 


du 


(11) 


This  curve  fitting  exercise  can  be  done  off-line  using  several  values  of  M  until  a  suitable 
trade-off  between  minimum  norm  and  minimum  M  is  obtained.  Sorenson  and  Alspach 
[1]  performed  this  curve  fitting  procedure  using  and  norms  for  a  uniform  density 
and  a  G2unma  density.  It  W2is  found  that  minimizing  the  X*  norm  resulted  in  many 
fewer  terms  in  the  Gaussian  sum  and  a  considerably  better  looking  approximation  for 
both  densities  compared  to  minimizing  the  1}  norm. 

Fig.  1  compares  a  Gamma  density  with  a  four-term  Gaussian  sum  density 
approximation  minimizing  the  X^  norm.  The  Gaussian  sum  curve  is  shown  to  fit  the 
Gamma  curve  reasonably  well.  The  Gamma  density  used  is 


p( 


u)  =1 
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u  >  0 
u  <  0 


(12) 


Each  term  of  the  Gaussian  sum  has  a  fixed  value  of  o-„  =  1.  Table  I  lists  the  values  of 
Pi  and  hi  used  in  the  Gaussian  sum. 


TABLE  I. 

Gaussian  sum  P,-,  values,  =  1. 
^  Pi  bj 

1  0.081  2.537 

2  0.432  2.553 

3  0.356  4.555 

4  0.131  6.933 
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III.  MODIFIED  GAUSSIAN  SUM  ADAPTIVE  FILTER 

A  modified  Gaussian  sum  adaptive  filter  is  now  developed  for  a  linear  system  with 
a  deterministic  input  signal,  nonGaussian  ^  .ant  noise,  and  nonGaussian  measurement 
noise.  The  system  is  modeled  in  standard  discrete-time  state-space  form  as 


^k+i  =  ^^k  +  Fufc  -f 

(13) 

Wk  =  ak  +  rrik 

(14) 

2k  =  HXk  +  Vk 

(15) 

Vk  =  bk  +  Ufc 

(16) 

where  is  the  state  vector 

U/f  is  a  known  deterministic  input 

lojt  is  the  vector  Gaussian  sum  signal  model  of  the  actual 

nonGaussian  plant  noise  process,  comprised  of  semi-Markov 
bias  vector  0^,  and  zero  mean  white  Gaussian  noise  m*.  with 
covariance  Q 

Zjf  is  the  measurement  vector 

Vf,  is  the  vector  Gaussi^ul  sum  signal  model  of  the  actual 

nonGaussian  measurement  noise  process,  comprised  of  semi- 
Markov  bias  vector  6*,  and  zero  mean  white  Gaussian  noise 
with  covariance  R 

r,  H  are  the  respective  constant  transition  matrices 

amd  the  random  quantities  ^ki  aissumed  to  be  mutually  statistically 

independent. 

The  optimal  estimate  of  the  state  vector  is  found  by  minimizing  the  meam- 
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square  error  between  and  This  results  in  the  conditional  mean  estimate 


^Jk+l  ~  I  ^fc+ll  —  ^ik+lP(^<:+l  I  ^Jk+l)  ^^k+1  (17) 

J  — oo 

where  ^*+1  is  the  current  measurement  sequence  conditionad 

density  function  of  (17)  can  be  written  as  the  ratio  of  the  corresponding  joint  aind 


marginal  densities 


_/_  I  '5'  \  _  P(^fc+l5‘^fc+l) 


The  two  bias  vectors  a  and  b  are  explicitly  brought  into  (18)  by  considering  the  joint 
density  Pi^k+ii^k+i)  lo  be  ^  maurginal  density  found  from 
P{^k+ii^k+v^k+i  —  ^ii^k+i  =  ^j)  by  summing  over  the  a  and  b  terms 


iV  M 

^2  P(^A;-l-l>  ^k+h  °fc+l  =  ==  ^j) 

I  Zm)  =  - 


Using  Bayes’  rule  and  using  p{xk+i,Zk+i,ai,bj)  as  shorthamd  for 
Pi^k+iT^k+ii^k+i  =  =  ^j)i  ^be  conditional  density  of  (19)  becomes 


N  M 


P{xk+i  I  ^fc+i)  =  S  2  Pi^k+1 1  Zk+i, Oi, bj)p{ai, bj  I  Zk+i)  (20) 

i=i  i=i 


Substituting  (20)  into  the  conditional  mean  (17),  and  interchanging  integration  and 
summation  results  in 


^  Af  r  i-oo  -j 

®<:+l  ~  ^fe+lP(^fc+l  I  P(o«)^j  I  ^fe+l) 

1=1  j=l  L-*  -<» 
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The  bracketed  integral  in  (21)  is  the  conditional  mean  estimate  of  given  that 
Ofc+i  =  <*i  ^ife+i  =  denoted  by 

^le+\~  I  ®fc4-lP(^fc+l  I  (22) 

J  —  OO 

In  effect,  xj^+i  represents  the  estimate  for  the  ij*^  density  combination  from  the  two 
Gaussian  sums.  A  fixed  set  of  N  x  M  (NM)  estimators  is  needed  to  provide  all  of  the 
individual  estimates.  The  overall  estimate  from  (21)  and  (22), 

N  M 

^k+1  ~  ^  I  ^k+l)  (23) 

i=l  i=l 

is  a  weighted  sum  of  the  NM  individual  estimates.  The  weighting  factor  p(a,-,6j  |  Z^^i) 
is  the  probability  that  and  6^4.1  =  bj  given  the  current  measurement  sequence 

Zjfc4i.  Since  the  number  of  terms  in  the  overall  estimate  (23)  is  fixed,  the  growing 
memory  problem  of  [1]  is  avoided. 

Each  estimate  (22)  is  found  from  a  Kalman  filter  equation,  modified  [4]  to  include 
the  two  bias  terms  and  bj  of  the  Gaussian  sum  densities,  given  by 

2fc+i  =  +  ^ai  +  Kfc4i[z;(,4i  -bj-  +  Tuk  +  ’^0^)]  (24) 

The  Kalman  gain  and  covariance  terms 

+  (25) 

=  +  (26) 
■^fc+1  —  ~  ^^k+l^)^^k+l  (27) 
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axe  the  same  for  each  estimate  (24)  because  the  covariances  Q  and  R  remain  fixed 
for  each  respective  bias  parameter  and 

The  weighting  term  p(a„6,  |  of  (24)  is  found  next.  Using 
and  Bayes’  rule,  the  weighting  term  becomes 


P(a.-.&i  I  Zk+i)  = 


Pj^k+i  I  ah^j^Zk)piai,bj  I  Zk) 

Pi^k+i  I  Zk) 


(28) 


The  first  term  of  the  numerator  of  (28)  can  be  approximated  by  a  Gaussian  density 
if  the  bias  terms  switch  slowly  compared  to  the  time  interval  k  [3],  given  by 


p{zk+i  1  bj,  Zk)  =  N[ff  ($xi>  +  Tuk  +  ^a,)  +  HMk^.H'^  +  R\  (29) 


with  mean  and  covaxiance  terms  avjdlable  &om  the  Kalman  filter  equation  (24)  and 
Kalman  gain  (26). 

The  second  term  of  the  numerator  of  (28)  is  the  predicted  probability  value  that 
°Jfc+i  ==  ^k+\  —  bj  given  the  past  measurement  sequence  Zk,  given  by 

N  M 

p(<‘k»  =  «i,  =  h  I  z»)  =  E  E  (30) 

0=1  /Jsl 

where 

C  =  P(afc+i  =  a.- 1  Ofc  =  Oo)  (31) 

Bi^  =  p{bk^,-^bi\bk^b^)  (32) 

«'?^  =  p(afc  =  aa,^fc  =  &/3l^*)  (33) 

The  terms  are  Markov  transition  probabilities  [5];  that  is,  9'°  is  the 

conditional  probability  that  a  =  a,-  at  time  interval  k+1,  given  that  a  =  a^  at  time 
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interval  k.  The  61^  term  is  similarly  defined.  The  term  is  of  the  same  form  as  (28) 
and  is  just  the  previous  weighting  term  at  the  previous  time  interval  k. 

The  denominator  term  of  (28)  is  independent  of  ij.  Therefore  it  is  the  same  for 
each  i  estimator  and  becomes  a  scale  factor. 

Combining  (29)  -  (33),  the  weighting  term  (28)  is  written  as  [4] 

"oiVi  =  AOi.l’i  I  Zti-i)  =  I  (34) 

0=1  ;3=1 

where  is  a  scale  factor  determined  at  each  time  intervad  such  that 

/V  M 

EE^iVi  =  i  (35) 

i=i  j=i 

guairan teeing  that  the  sum  of  all  the  weighting  terms  (34)  is  equal  to  one. 

The  structure  of  the  overall  MGS  adaptive  filter  is  a  fixed  bank  of  NM  Kalman 
filters  operating  in  parallel,  with  each  individual  estimate  multiplied  by  its  own 
>.01  responding  weighting  term.  The  ij*^  estimator  based  on  the  bias  terms  that  most 
closely  matches  the  actual  bias  terms  of  the  modeled  system  will  have  a  corresponding 
weighting  term  that  tends  closer  to  one,  while  the  weights  of  the  other  mismatched 
estimators  will  tend  towards  zero.  A  block  diagram  of  the  MGS  adaptive  filter  is  shown 
in  Fig.  2. 

IV.  SIMULATION  EXAMPLE 

An  example  illustrating  the  modified  Gaussian  sum  estimation  technique  is  now 
presented.  A  first-order  system  is  used,  modeled  by  the  following  discrete-time 
equations 
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Xk+I  =  e"®^Xfc  +  (1  -  e-**^)^*  +  (36) 

tUfc  =  Ofc  +  mfc  (37) 

Zfc  =  Xfc  +  Ufc  (38) 

Vk  =  h  +  «fc  (39) 


The  value  of  a  is  0.6  2uid  the  sample  time  T  =  1  second.  For  simplicity,  let  the 
plant  noise  (37)  be  zero  mean  white  Gaussian  with  variance  Q  =  1.0.  Therefore,  the 
randomly-switching  plant  bias  term  takes  on  the  vtilue  of  a*.  =  0.  The  MGS  adaptive 
filter  structure  now  reduces  to  a  bank  of  only  M  filters  operating  in  parallel.  The  actual 
measurement  noise,  modeled  by  (39),  has  the  Gamma  density  of  (12),  with  a  mean  and 
vari2Lnce  of  4.  The  measurement  bias  term  6^  can  be  randomly  selected  firom  the  last 
three  bias  terms  of  TABLE  I,  {2.553,  4.555,  6.933}.  The  first  bias  term  of  TABLE  I, 
{2.537},  is  not  used  since  it  is  so  close  in  value  to  the  second  bias  term.  The 
measurement  model  noise  term  n*  is  zero  mean  white  Gaussian  with  variance  jR  =  1.0. 
A  deterministic  input  of  =  10  is  used  throughout  the  simulation.  Fig.  3a  shows  the 
measurement  and  state  sequences.  Note  how  the  measurement  is  centered  about  14, 
indicating  a  mean  value  for  the  Gamma  density  of  4. 

The  filter  is  initialized  with  equally-valued  weighting  terms 


uii  =  j,  for  i=  1,2,3 


(40) 


A  Markov  transition  probability  matrix  0^^,  consisting  of  $1^  elements,  is  configured 
with  a  high  probability  that  the  bias  term  does  not  switch  from  one  value  to  another, 
and  a  low  probability  that  the  bias  term  does  switch,  given  by 


.95  .025  .025 
.025  .95  .025 
.025  .025  .95 


(41) 
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The  initial  value  of  the  state  and  state  estimate  is  Xq  =  Iq  =  20,  and  the  initial 
value  of  the  variance  of  the  error  is  =  100.  The  overall  state  estimate 

and  the  state  are  shown  in  Fig.  3b,  with  the  error  and  overall  state  estimate  shown  in 
Fig.  3c.  Note  how  the  error  appears  to  be  zero  meem,  thus  showing  that  the  MGS 
a  daptive  filter  removes  the  bias  effect  of  the  nonzero  me2ui  Gamma  measurement  noise. 

Figs.  4a,  b,  and  c  show  the  weighting  terms  for  each  of  the  bias  terms.  In  order 
to  lessen  the  noise  of  the  weighting  terms,  a  first-order  lowpaiss  filter 

wi^i  =  Awi  +  (1-  A)wi^j  (42) 

is  used  to  smooth  the  weighting  terms,  where  A  =  0.7.  Fig.  4d  can  be  thought  of  as  the 
overall  measurement  bias  estimate  6*  due  to  the  nonzero  mecin  Gamma  mezisurement 
noise.  Using  (24)  and  (34),  this  overall  bias  estimate  is  part  of  the  overall  state 
estimate  (23)  and  is  written  in  this  case  with  a*  =  0  as 

(43) 

i=i 

Note  how  this  overall  bias  estimate  approximately  models  the  mean  value  of  4  of  the 
Gamma  measurement  noise. 

V.  CONCLUSIONS 

An  estimation  technique  has  been  developed  which  can  cope  effectively  with 
nonGaussian  signals.  This  MGS  adaptive  filter  is  comprised  of  a  fixed  set  of  estimators 
operating  in  parallel  with  each  individual  estimate  possessing  its  own  corresponding 
weighting  term.  The  MGS  adaptive  filter  is  suitable  for  practical  implementation  since 
it  avoids  the  growing  memory  problem  of  a  previously  developed  algorithm. 
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Abstract 

This  paper  describes  the  use  of  distributed  filtering  to 
realize  arbitrary  scalar  system  outputs  from  the  output  of  a 
distributed  sensor.  The  set  of  scalar  position  measurements 
realizable  using  traditional  point  sensors  is  a  subset  of  the  set  of 
scalar  outputs  realizable  using  distributed  sensing  and  filtering. 
This  is  one  of  the  advantages  of  distributed  sensing  over  point 
sensing.  Several  examples  are  provided  to  illustrate 
applications  of  dhtributed  filtering.  It  is  shown  that 
distributed-effert  sensors  can  be  used  to  implement  functional 
observers,  providing  a  significant  reduction  in  compensator 
order. 

1.  IntrodMction  . 

Sensor  placement  continues  to  be  an  important  issue  in 
the  control  of  flexible  structures.  Restrictions  on  sensor 
placement  often  result  in  limitations  on  the  set  of  realizable 
system  outputs  or  measurements  -  particularly  when  the 
outputs  or  measurements  are  produced  by  point  sensors  located 
on  the  structure. 

Distributed  sensors  provide  an  attractive  solution  to 
many  of  the  problems  associated  with  the  use  of  point  sensors 
in  flexible  structure  control.  Distributed  sensors  produce  a 
spatially  continuous  measurement  of  the  system  behavior. 
Most  control  systems,  however,  employ  scalar  signal  processing 
and  computing  technology;  the^ore,  distributed  sensor  outputs 
are  usually  converted  into  scalar  sigBals  before  they  are  made 
available  to  the  control  system.  The  challenge  is  to  retain  in 
the  scalar  signal,  as  much  of  the  information  contained  in  the 
distributed  signal  as  possible.  The  process  of  converting  the 
distributed  measurement  into  a  scalar  signal  is  essentially 
distributed  filtering. 

The  work  presented  in  this  paper  provides  the 
theoretical  foundation  for  the  use  of  distributed  sensing  and 
filtering  in  a  variety  of  control  applications.  This  technology  is 
already  beii^  wpliM  experimentimv,  particularly  in  the  area  of 
modal  filter!^  [4,3].  The  idea  of  filtering  distributed  signsls  to 
provide  scalar  sensor  outputs  also  ^^es  to  the  design  of 
distributed-eSect  sensors  •  sensors  that  produce  a  scalar  output 
that  is  based  on  a  distribnted  measurement  [2]. 

In  this  papa,  we  consida  the  ^plication  of  distributed 
sensing  and  filtoing  to  the  class  <d  one-dimensional  flexible 
stmetura  represented  by  flexible  beams,  and  it  is  assumed  that 
there  are  no  restrictions  on  the  filta  functions  that  may  be 
implemented,  hi  section  2  we  describe  mathematical  modds 
for  t^  claM  of  systems  and  sensors  considered  in  this  papa.  In 
the  third  section,  we  describe  the  filtering  process  and  show  the 
existence  of  filta  functions  for  realizing  any  arbitrary  scaUr 
system  output  consisting  of  a  linea  combination  of  the  system 
position  stata.  In  section  4,  we  present  several  exampla  of 
distributed  filtering  wplications,  i^uding  the  daign  of  low- 
orda  compensators.  ^Aie  final  sectian  contains  a  summary  of 
the  results  presented  in  this  papa  and  describes  some  areas  for 
future  reseuch. 


2.  Sntsimil  afiBflsc  mflddt 

In  this  section  we  describe  the  mathematical  model  for 
the  sensor  and  a  mathematical  model  fot  the  class  of  flexible 
structura  considered  in  this  papa.  Since  we  are  concerned 
with  the  application  of  distributed  sensing  to  the  class  of 
flexible  structura  subject  to  bending  and  vibrations  in  one 
dimension,  we  consida  as  a  representative  example  the  model 
for  a  flexible  beam. 

2.1  System  model 

Let  y(x,t)  denote  the  deflection  of  the  flexible  structure 
from  eqnilibnnm  (defined  as  y(x.t)^(f)  at  the  point  s  in  dte 
domain  of  the  structure,  D(.'\f):w{zJ  0<  s<  2],  at  time  t  Fa 
zeD(^),  deacriba  the  ms^  at  the  structure  a  a  function 
of  time.  We  shall  assume  then  exists  a  set  of  functians  {dj(*)} 
which  form  an  orthonormal  basis  for  Ln^l),  the  set  of  w 
square-integrable  functions  defined  on  DCV)  Im.  At  any  time 
t^,  the  set  of  all  structural  sbapa  y(x,t)  is  cootained  in 
therefore,  given  anv  ^}u^}e  yfx,i),  thm  exists  a  act  of 
time  varying  weights  such  th^ 

(1) 

The  basis  functions  {^j(x)}  are  commonly  refered  to  s.'  ' 
mode  shapa  of  the  structure,  and  the  time  varying  : 

{"Ijft)}  called  the  modal  amplituda  or  modal  weight 

While  y{x,tj  in  ^1)  is  ea^resaed  m  an  infinite  sum 
mode  shapa,  we  typically  assunw  that  y(*,t)  can  be 
reasonably  approximate  by  a  finite  numba  of  nuMa,  so  that 

(2) 

J=l 

Fot  models  of  flexible  structura,  the  uppa  limit  N  in  (2)  may 
be  very  large. 

Given  the  basis  {d,-(*)}>  the  modal  amplituda  (irXt)} 
completely  describe  the  shape  of  the  structure,  y(*,t).  Ia''tM 
modal  basis,  one  mathematiol  model  of  the  flexible  s^cture  is 
described  by  a  second-orda  adinary  differential  equation  of  the 
form 

mt)+KT,(ihBx(t),  (3) 

where 

»KfH  (*) 

The  opaators  M,  K,  and  B  in  (3)  are  known  u  the  maa, 
stiSnea,  and  input  influence  matnea  respectively,  and  x(t) 
denota  external  forca  (control  a  disturbance)  whi<^  may  act 
on  the  structure. 


Let  *(t)  denote  a  sjrttem  output  or  meaeuremoit. 
System  poeitioa  outputs  are  modelled  as  linsar  combinations 
t£e  system  position  states  in  (3)  -  that  is,  there  exists  an  output 
or  measurement  matrix  C  of  ^>propriate  dimensions  such  that 

(5) 

In  the  special  case  where  x(t)  is  a  position  measurement  that 
corresponds  to  the  output  of  a  point  sensor  located  at  a  point 
on  the  structure,  x(t)=y(xQ,t)  and  the  output  matrix  C  in 
(5)  IS  given  by 

C=(^l(*h)  (6) 

where  dj(ah)  denotes  the  value  of  the  j-ih  mode  shape  evaluated 
at  the  point  vsxq. 

2.2  Sensor  and  filter 

The  output  of  an  ideal  distributed  position  sensor  is 
simply  the  structural  sh^>e  y(x,t).  An  example  of  such  a  sensor 
is  the  Olographic  sensor  described  in  [1,2].  For  structural 
displacements  within  the  linear  range  of  the  sensor,  the  spatial 
variation  in  the  intensity  of  the  optical  signal  is  proportional  to 
the  shape  od  the  structure  at  rime  t  [Ij. 

To  convert  the  iniormation  provided  by  the  distributed 
sensor  into  a  form  that  can  be  processed  by  the  control  system, 
one  or  mote  scalar  signals,  e(t),  are  formO  by  performing  the 
spatial  filtering  operation 

y(x,t)  dx.  (10) 

We  call  Kfx)  the  filter  transmittance  or  weighting  function,  and 
refer  to  e(i)  as  the  filtered  output  of  the  distributed  sensor.  In 
the  case  of  the  hologr^hic  sensor  this  is  accomplished  by 
passing  the  output  through  an  optical  filter  with  a  spatially 
varying  optical  transmittance,  Kfx),  before  it  is  processed  by  a 
photodetector.  The  photodetector  produces  an  output 
proportional  to  the  spacial  integral  at  the  intensity  of  the 
incident  optical  signal. 

In  a  distributed-effect  sensor,  the  output  of  the  sensor  is 
a  spatially  filtered  function  of  y(x,t),  but  the  function  y(x,t)  is 
not  directly  available.  An  example  of  this  type  of  sensor  is  a 
modal-domain  optical  fiber  sensor.  An  optical  fiber  attached  to 
a  beam  is  sensitive  to  strain  along  its  length.  If  the  sensitivity 
of  the  fiber  to  strain  can  be  varied  as  a  function  of  position 
along  the  fiber,  then,  ignoring  some  constants,  the  output  of  a 
photodetector  at  the  &r  end  <n  the  fiber  can  be  assumed  to  be 

t(t)^\^K(x)^(x,t)ix.  (11) 

The  net  effe-  \t  of  spatially  filtering  and  integrating  the 

output  of  a  '  -cd  sensor,  but  a  distributed  output  is  never 

directly  availaole. 

3.  Filter  desi^ 

For  simplicity,  we  will  restrict  our  attention,  in  this 
section,  to  distributra  positict.  censors,  but  the  results  may  be 
extendi  to  other  diriributed  sensors,  such  as  those  that 
measure  strain  or  velocity.  If  the  system  output  z(t)  in  (5) 
corresponds  to  the  output  eft)  in  (10),  then 

j^^Kfx)  yfx,t)  Jx=Cv{t).  (12) 

We  need  to  answer  the  following  question:  Given  an  arbitr^ 
output  matrix  C.  does  there  exut  a  filter  weighting  function 
Kfx),  such  that  (12)  holds?  If  the  answer  to  this  question  is 

yes,  then  using  distributed  sensor  output  filtering,  we  can 
realize  any  desired  scalar  system  position  output  or 
measurement. 


Substituting  (2)  into  (10)  and  interchaagiag  the  of 
integration  and  summation,  the  output  may  be'written  as 

d/*)  dr)  (13) 

Because  the  mode  sh^>es  (dXz)}  are  a  basis  fi»  the  set  of  all 
square-intenable  functions  d^nra  on  OfH)  [5],  they  are  also  a 
basis  for  the  set  of  all  square-integrahle  fiUter  transmittance 
functions.  If  we  restrict  Kfx)  to  the  finite-diinensional  mace  of 
square-intenable  functions  spanned  by  the  set  of  N  mode 
shapes  used  to  approximate  y  fx,t)  in  (2),  then  there  exists  a  set 
of  scalars  (hj,  ...  ,  k/f}  such  th^ 

^  *i  di(*)-  (14) 

Substituting  the  modal  representation  (14)  for  Kfx)  into 
the  expression  (13)  for  the  output  irields 

*.<*)  H*)  <'»>;<*)•  (15) 

Interchanging  the  order  of  summation  and  integration  in  (15), 
and  recalling  that  the  mode  shapes  are  orthonormal  yields 

i  kj  q  <t).  (16) 

Equation  (16)  implies  that  for  xft)=eft),  as  in  (12),  the  output 
matrix  C  is  given  by 

C=(hi  *2-  (17) 

Conversely,  for  an  arbitrary  output  matrix  C  with  elements  Cj, 


Kr*;=f:cjd/*),  (18) 

}=i 

then  the  filtered  output  of  the  distributed  sensor,  eft),  can  be 
written  in  the  form  (5),  and  equation  (12)  is  satisfied. 

4.  Examples 

4.1  Point  w-nsinf 

One  application  of  distributed  sensors  is  to  duplicate  the 
output  of  a  point  sensor  located  somewhere  on  the  structure. 
The  output  matrix  C  for  xft)=yfxff,t),  was  given  in  (6).  By  (18), 
the  corresponding  weighting  function  is 

Kfx)=.f^,^  {xo)  d;<*).  (19) 

>=l 

Setting  the  scalar  output  xft)  equal  to  the  output  eft)  in  (11), 
yields 


yf^,t)-\^JKfx)yfx,t)  Jx, 

(20) 

Kfx)^(x-Xo), 

(21) 

where  f(z— s^)  is  the  Dirac  delta  function  centered  at 

This  example  shows  that  the  set  of  filter  weighting 
functions  that  produce  scalar  outputs  corresponding  to  the 
outputs  of  point  sensors  is  e^al  to  the  ^  of  delta  functions 
centered  at  points  ^eD(^).  Clearly,  this  set  is  only  a  subset  of 
the  set  of  u  admissable  filter  wonting  functions;  therefore, 
the  set  of  position  output  matrices  realizable  using  pc^t 
sensors,  is  a  subset  of  the  set  at  output  matrices  tealisame  using 
distributed  sensor  filtering. 


4.2  MsdilfiltfiDBS 

Another  application  of  <^tribute<l  sensors  is  modal 
filtering.  The  filter  wei^ting  function  K(x)  may  be  chosen  to 
eliminskte  certain  modu  components,  or  mod<»,  from  the 
system  output.  This  is  analogous  to  the  fiuniliar  notion  of 
band-limited  ffltering  in  conventional  signal  processing.  The 
concept  of  distributM  modal  filtering  for  flexible  structures  has 
been  demonstrated  using  distribute-  effect  sensors  made  of 
piezoelectric  film  r4],  dm  was  used  to  explain  the  robustness 
properties  of  an  LQG  control  system  for  a  flexible  beam  using 
the  holographic  sensor  [3]. 

The  usefulness  of  distributed  sensors  for  modal  filtering 
is  a  result  of  the  orthogonality  of  the  mode  shapes  that  are  the 
basis  for  the  set  of  filter  wei^ting  functions  and  the  structure 
shape  functions.  Choosing  a  set  of  modes,  such  as  the  set 
Bs={6i  (x)\0<tn<M\,  and  selecting  a  filter  weighting 

functiW 

K(x)=£  d,-  (x).  (22) 

m=0 

composed  only  of  modes  contained  in  B,  the  filtered  distributed 
sensor  output  is 

fcm  (23) 

im=0 

Because  the  mode  shap«  are  mutually  orthogonal,  any  modes 
contained  in  the  distributed  measurement  y(x,t)  but  not 
contained  in  the  set  B  are  removed  by  the  filter  and  do  not 
appear  in  the  filtered  output.  Modal  filtering  is  particularly 
useful  for  reducing  observation  spillover  when  the  control 

system  is  designed  using  a  reduced  order  model  of  the  system 
since  it  permits  removal  of  any  unmodelled  system  dynamics 
from  the  measurement  signal. 

4.3  Diatribated  (ymipensator  Gmus 

To  control  a  system  described  by  the  second-order 
differential  equation  (3)  using  a  single  actuator,  a  standard 
approach  is  to  rewrite  the  equation  in  first  order  form 

i  =  A  X  +  B  u  (24) 

z  =  Cx 

using  the  state  x=[q‘^  A  state  feedback  gain  is  computed 

so  that  the  control  u  =  -kx  provides  the  desired  performance. 
Then,  since  the  entire  state  is  not  available  for  feedback,  a 
state  estimator  is  constructed  to  provide  an  estimate  of  x, 
denoted  i,  and  the  feedback  «  =  -k2  is  used.  A  full-order  state 
estimator  would  be  of  dimension  2N  and  have  the  form 

'£=  (A- LC)  z+ Bu+  L  z.  (25) 

If  X  is  of  dimension  m,  then  a  reduced-order  observer  of 
dimension  2N-m  can  be  used  to  provide  an  estimate  of  x.  To 
reduce  the  compensator  dimension  still  further,  consider  the 
case  where  s  s  It  is  easy  to  see  that  the  observability  index 
for  such  a  system  will  be  2.  Thus,  a  functional  obs^er  of 
dimension  1  can  be  constructed  [6].  A  functional  observer  has 
the  form 

h=aFr-i-Gz+Hu  (26) 

w*  Mr+  P  X 

and  the  property  that  w  approaches  -kx  exponentially,  with 
time  constants  determined  by  the  eigenvalues  of  F. 

To  implement  the  functional  observer,  two 
measurements  of  the  form  Cki  and  A7  are  needed,  where  G  and 
P  are  1  X  N  vectors  computed  from  ^e  system  parameters  and 
the  state  feedback  gain.  The  previously  developed  theory  can 
be  used  to  show  h^  such  measurements  can  be  made  on  an 
Euler-Bemouli  beam  using  two  optical  fiber  sensors. 


For  an  Euler-Bemouh  beam  model,  the  mode 
<t>,\x),  are  not  only  orthononnal,  but  satisfy  the 
orthogonality  relation 

d/'Cx;  dx  =  B*jSij.  (27) 

Thus,  the  previous  theory  can  be  used  to  see  that  choosing 

(28) 

Pj 

where  is  the  fth  element  of  G,  makes  the  output  of  the  fiber 
sensor  equal  to  Gif,  as  desired.  By  using  two  weighted  fiber 
sensors  a  first-order  compensator  can  be  developed  to 
implement  the  desired  control  law. 

To  illustrate  these  results,  a  four-mode  model  of  an 
Euler-Bemouli  beam  was  considered  The  tip  displacement  and 
strain  energy  for  an  inital  condition  response  of  the  open-loop 
system  are  shown  in  Figure  1.  The  response  for  a  do^-Ioop 
system  using  a  full-order  compensator,  where  the  measurerr-*"*''^ 
Ltf  were  obtained  using  fiber  sensors  in  a  manner  anal 
that  described  above  tor  the  functional  observer,  is  si. 

Figure  2.  Figure  3  shows  the  response  for  a  closed-loop  - 
using  a  functional  observer.  The  response  is  almost  identic^., 
that  of  a  full-order  compensator,  even  though  the  compei^toi 
order  has  been  reduced  from  eight  to  one.  The  weighting 
functions  for  the  fibers  used  with  the  fimctional  observer  are 
shown  in  Figure  4. 

One  problem  usually  encountered  in  using  functional 
observers  is  that  the  direct  feedthrough  of  the  output  results  in 
excessive  noise  in  the  estimates.  Fib«  sensors,  however,  have  a 
very  high  signal  to  noise  ratio,  and  this  should  not  be  a 
problem.  Also,  aa  discussed  in  Sdtion  4.3,  spillover  noise  will 
not  be  a  problem.  The  im^or  tradeoff  between  a  full-order 
compensator  implemented  using  point  measurements  and  a 
first-order  compensator  implemented  using  weighted  fiber 
sensors  is  between  the  complexity  of  the  compensator  and  the 
complexity  of  implementing  the  distributed  gain  in  the  fiber. 

5.  <TnnrliMinM 

In  this  paper,  we  have  shown  that  by  filtering  the 
output  of  a  distribute  sensor,  we  can  produce  a  scalar  output 
corresponding  to  any  desired  linear  combination  of  the  system 
position  states.  This  result  illustrates  one  of  the  advantages  of 
distributed  and  distributed-effect  sensors  over  point  sensors  - 
the  set  of  scalar  outputs  realizable  using  point  sensors  is  v 
subset  of  the  set  of  outputs  realizable  using  distributed  sensi:. 
and  filtering. 

We  also  described  three  applications  of  distributed 
sensor  filtering.  Other  applications  that  are  currently  under 
investigation  include  the  use  of  distributed  filtering  for  the 
placement  of  transfer  function  zeros,  and  the  optimization  of 
measures  of  observability. 
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Figure  l.a  ResMnse  of  the  open- loop  system  to  initial 
conmtions. 
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Figure  l.b  Response  of  the  open-loop  system  to  initial 
conditions. 
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Figure  2.a  Closed-loop  system  response  using  full-order 
observer. 
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Figure  2.b  Closed-loop  system  response  using  full-order 
observer. 


Figure  3.a  Closed-loop  system  response  using  functional 
observer. 
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Figure  3.b  Cloaed-loop  system  response  using  functions! 
observer. 


Figure  4.S  Distributed  filter  weighting  functions  used  to 
implement  the  functional  observer. 


Figure  4.b  Distributed  filter  wei^ting  functions  used  to 
implement  the  functional  observer. 
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Abstract 

In  this  paper  the  method  of  using  Fuzzy  Logic  is  explained  and  an  application  of  Fuzzy 
Logic  Control  (FLO  to  actively  damp  the  vibrations  of  a  cantilever  beam  is  investigated.  The 
results  of  this  method  are  compared  to  controlling  the  structure  using  optimal  control.  The 
primary  advantage  of  FLC  is  that  satisfactory  control  can  be  achieved  without  detailed  knowledge 
of  the  plant.  A  by-product  of  the  method  is  that  the  FLC  is  able  to  utilize  the  maximum  control 
effort  available  at  the  actuator. 

Introduction 

Since  the  early  1970's  it  has  been  proposed  that  the  theory  of  "Fuzzy  Logic"  could  be 
used  to  design  control  systems  for  poorly  modeled  and/or  complex  systems  [1].  To  date  many 
successful  applications  have  been  documented  (2,3,41.  The  principal  advantage  of  a  Fuzzy 
Controller  is  its  performance  and  simplicity  of  design  in  the  absence  of  accurate  plant  models. 
The  typical  Fuzzy  Controller  is  a  combination  of  Fuzzy  Logic  and  rule-based  expert  systems.  The 
"rules"  express  the  control  policy  much  as  people  do.  The  fact  that  rule  conditions  are  stated  in 
"human"  (qualitative)  terms,  such  as  "the  error  is  moderately  large",  rather  than  in  "computer" 
(quantitative)  terms  which  require  numerical  ranges,  converts  the  use  of  very  few  rules  into  an 
effective  controller.  Lee  [5]  provides  an  excellent  tutorial  paper  with  a  comprehensive  collection 
of  references. 

The  past  few  years  have  witnessed  an  exponential  growth  in  the  applications  of  Fuzzy 
Control;  however,  the  major  interest  has  been  in  Japan  and  Europe  and  not  in  the  U.S.  where  the 
concept  originated.  Outside  of  academic  investigations  the  use  of  Fuzzy  Control  seems  to  have 
been  relegated  to  household  appliances,  printers,  and  other  "simple"  controllers.  One  exception 
is  that  of  the  controllers  used  on  some  of  the  Japanese  trains  [61.  But  in  all  cases  the  applications 
appear  to  be  slight  improvements  in  controllers  which  could  have  been  designed  by  classical 
techniques. 

A  Fuzzy  Logic  Controller  (FLC),  see  Fig.  1,  can  be  viewed  as  a  way  of  converting  expert 
knowledge  into  an  automatic  control  strategy  without  a  detailed  knowledge  of  the  plant.  The 
input  to  the  FLC  is  fuzzified,  or  in  other  words,  converted  to  the  Fuzzy 

Set  Domain.  Then  it  is  processed  by  the  Fuzzy  inference  Engine  and  a  Fuzzy  output  is  obtained 
(in  the  form  of  a  sum  of  weighted  Fuzzy  Sets).  That  in  turn  is  defuzzified,  or  converted  back  into 
a  number  in  the  real  Domain  which  is  used  as  the  control  input  to  the  process  to  be  controlled. 
To  provide  a  benchmark  for  comparison,  the  example  system  will  be  controlled  using  an  observer- 
based  controller.  With  observer-based  control  (Fig.  2)  the  closed  loop  system  consists  of  an 
observer  to  reproduce  the  states  from  the  output  and  state  variable  feedback  with  gains  derived, 
e.g.  by  optimal  control,  to  provide  the  feedback  correction. 
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Both  of  these  ideas  wil!  be  applied  to  actively  damp  the  vibration  of  a  cantilever  beam. 
A  system  will  be  assumed  for  simulation  purposes  and  both  control  strategies  will  be  compared. 

Fuzzy  set  theory 

Fuzzy  set  theory  was  developed  in  an  effort  to  deal  with  the  uncertainty  and 
impreciseness  that  is  abundant  in  the  real  physical  world.  It  was  expected  to  model  human 
reasoning,  the  ability  of  the  human  mind  to  deal  with  vague  terms,  and  the  ability  to  make 
decisions  based  on  imprecise  data. 

The  definitions  on  Fuzzy  Set  Theory  are  as  they  appeared  on  Zadeh'sIB]  seminal  paper: 
A  Fuzzy  Set  is  a  class  of  objects  with  a  continuum  of  grades  of  membership,  it  is 
characterized  by  a  membership  (or  characteristic)  ftmction  which  assigns  to  each  object,  a 
grade  of  membership  ranging  between  0  and  1 .  More  formally: 

Let  X  be  a  space  of  points  and  x  in  X  (X  is  also  called  the  Universe  of  discourse). 

Then  a  Fuzzy  set  A  €  X  is  characterized  by  a  membership  function  which  associates  with 
each  point  in  X  a  real  number  in  the  interval  (0,1)  with  the  value  of  ^a(x)  at  x  representing  the 
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"grade  of  membership”  of  x  in  A. 

An  ordirtary  set  then  is  a  set  where 
fjfS*)  only  takes  the  values  of  0  and  1  on  X. 

An  ordinary  set  could  be  viewed  as  a 
special  case  of  a  Fuzzy  Set,  or  a  Fuzzy  Set 
could  be  viewed  as  an  extension  of  an 
ordinary  set. 

Two  fuzzy  sets  are  equal  if  they  have 
identical  membership  functions  for  all  objects 
of  X. 


A  -  B  p^(x)  -  p^),  V  X  e  X 


The  complement  of  a  fuzzy  set  A  is 
denoted  by  A'  and  is  defined  by 

PxW  -1-P»  .  VxeX 


A  fuzzy  set  A  is  contained  in  B  (or  is  a  subset  of  B,  or  is  smaller  than  or  equal  to  B)  if  and 
only  if  the  membership  function  of  A  is  smaller  than  or  equal  to  the  membership  function  of  B  for 
all  objects  of  X. 


A  cB  p^(jt)  i  p^),  V X  e  X 

Union:  The  union  of  two  fuzzy  sets  A  and  B  with  membership  functions  fijx)  and 
respectively  is  a  fuzzy  set  C  written  as  C  ^  A  U  B  vrhere  the  membership  function  of  C  at  x  is 
the  maximum  of  the  2  membership  functions. 

Pc(x)  -  >naxlp4(i^.p^)].  V  X  €  X 

or  in  abbreviated  form 


Pc(*)  -  PxW  V  p,(*) 


(The  union  of  A  and  B  is  the  smallest  fuzzy  set  containing  both  A  and  B) 

Intersection:  The  intersection  of  two  fuzzy  sets  A  and  B  with  respective  membership 
functions  ;ia(x)  and  //s(x)  is  a  fuzzy  set  C  written  as  C  =  A  fl  B  whose  membership  function  is 
related  to  those  of  A  and  B  by 


Pc(*)  “  fnlnCp^W.PjC*)].  V  X  6  X 

or  in  abbreviated  form 


Pc(*)  -  PxW  A  p*(x) 

(The  intersection  of  A  and  B  is  the  largest  fuzzy  set  which  is  contained  in  Doth  A  end  B). 

The  Algebraic  product  of  two  fuzzy  sets  A  and  B  is  defined  by  their  membership  functions 
as  UmM  =ft^{x)ut{x).  (Note  ABS(AnB))  and  the  Algebraic  Sum  is  defined  as 
(Note  that  ((AUB)S(A  +  B)). 

A  normal  fuzzy  set  is  a  set  that  attains  the  maximum  of  1 . 
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A  convex  fuzzy  set  is  a  set  such  that  (see  Fig.  3) 

i  x^.  *2  e  X.  X  efO.ll 

A  fuzzy  number  is  defined  as  the  fuzzy  set  which  is  both  normal  and  convex  (Note  that 
the  set  in  Fig.  3  is  a  fuzzy  number). 

A  fuzzy  partition  on  the  universe  of  discourse  is  the  association  of  each  linguistic  variable 
with  a  term  set  on  the  universe  of  discourse  (PB,PM,...).  So  a  linguistic  variable  takes  on  values 
of  different  fuzzy  sets  defined  on  a  certain  universe  of  discourse.  An  example  of  a  fuzzy  partition 
of  the  input  space  (of  the  fuzzy  controller)  is  given  in  Appendix  A  Rgure  1 .  You  can  note  that  the 
primary  set  of  this  partition  is  a  triangular  set. 

The  last  definition  we  are  interested  in  at  this  time  is  the  concept  of  a  fuzzy  relation  or 
fuzzy  implication  which  is  a  generalization  of  the  concept  of  a  function.  An  n-ary  fuzzy  relation 
in  X  is  a  fuzzy  set  in  the  product  space  X  x  X...x  X.  For  such  relations  the  membership  function 
is  of  the  form  where  Xj€X,  i  =  1,2,...,n.  This  definition  is  the  one  that  permits  the 

use  of  fuzzy  logic  control. 

Fuzzy  Logic  Controller 

Control  was  one  of  the  first  areas  where  Fuzzy  Set  Theory  was  applied  and  it  has  enjoyed 
a  great  deal  of  success  in  applications  where  the  systems  to  be  controlled  were  ill-defined,  or 
where  the  process  dynamics  were  more  or  less  defined  by  the  experience  of  an  expert  operator. 
Most  of  these  systems  have  a  very  slow  response  but  with  the  emergence  of  Fuzzy  Logic  chips 
the  ability  to  control  faster  responding  systems  was  realized. 

The  FLC  provides  an  algorithm  which  can  convert  the  linguistic  control  strategy  based  on 
expert  knowledge  into  an  automatic  control  strategy.  The  essential  part  of  the  FLC  is  a  set  of 
linguistic  control  rules  related  by  the  dual  concepts  of  fuzzy  implication  and  the  compositional 
rules  of  inference  (Lee!  51). 

The  FLC  consists  of  3  components  (see  Fig.  1): 

A.  The  Fuzzification  interface 

B.  The  knowledge  base  and  the  decision  making  logic 

C.  The  defuzzification  interface 

A  designer  has  to  take  into  account  many  design  variables  that  have  to  do  with  how  each 
linguistic  variable  is  defined  and  how  you  go  from  the  Real  domain  to  the  Fuzzy  Set  domain  and 
back. 

A.  The  fuzzification  interface 

1 )  measures  the  values  of  the  input  variables  (output  and  input  process  variables). 

2)  performs  a  scale  mapping  that  transfers  the  range  of  values  of  input  variables 
into  corresponding  universes  of  discourse. 

3)  converts  the  input  data  into  suitable  linguistic  values  which  may  be  viewed  as 
labels  of  fuzzy  sets  (or  linguistic  hedges,  basically  the  labels  PS,  NM  etc). 

B.  The  knowledge  base  comprises  a  knowledge  of  the  application  domain  and  the 
attendant  control  goals. 

1 )  data  base  provides  necessary  definitions 

2)  rule  base  characterizes  the  control  goals  and  the  control  policy  of  the  domain 
experts  by  means  of  a  set  of  linguistic  control  rules. 

The  decision  making  logic  is  the  heart  of  an  R.C;  it  has  the  capability  of  simulating  human 
decision  making  and  of  inferring  fuzzy  control  actions,  employing  fuzzy  implication  and  the  rules 
of  inference  in  the  fuzzy  logic. 

C.  The  defuzzification  interface  performs 

1 )  scale  mapping  which  converts  the  output  variables  into  corresponding  universes 
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of  discourse. 

2)  defuzzification  which  yields  a  nonfuzzy  control  action  from  an  inferred  fuzzy 

control  rule. 

The  fuzzy  control  rules,  also  known  as  linguistic  description  rules  have  the  format: 

”if  (a  set  of  conditions  are  satisfied)  then  (a  set  of  consequences  can  be  inferred)” 
and  they  are  implemented  by  a  fuzzy  implication  (a  fuzzy  relation). 

Very  important  to  the  database  is  the  normalization  /discretization  of  the  universes  of 
discourse,  the  fuzzy  partition  of  the  input  and  the  output  spaces,  the  completeness  of  the  data 
base  and  the  choice  of  the  function  for  the  primary  fuzzy  set.  For  the  rule  base  it  is  the  choice 
of  the  process  variables  the  source  and  derivation  of  the  fuzzy  control  rules,  the  types  of  the  fuzzy 
control  rules,  their  consistency,  interactivity  and  completeness.  These  are  the  guidelines  a 
designer  has  to  follow  and  take  into  account  in  order  for  the  FLC  to  be  effective. 

A  popular  way  for  deriving  a  real  output  from  the  system  (defuzzification)  is  the  method 
of  the  centroid.  Suppose  that  2  or  more  rules  apply  in  a  particular  case  the  jth  applied  rule  being 
(Appendix  A  Fig.  2): 

i^xl  is  /fy,  is  ...  .  then  y  is 

then  the  weight  Wj  is  given  by 

wj  -  /,/*,)  A  7^(4)  A  ...  A  /^J 

or  for  simplicity 

*  -  *  V**) 

Then  respectively  B*  (the  Fuzzy  output  set)  for  both  cases  is 

5*  -  U  *1/  and  -  E  • 

J  i 

The  real  output  y  can  then  be  inferred  by  taking 

/g*0>)  y  dy 

^  fs'iy)  dy 

The  first  case  is  what  is  known  as  the  max-min  convolution,  and  the  second  is  the  sum-product 
convolution. 

System 


For  simulation  purposes  the  equations  used  were  of  a  cantilever  beam  where  both  the 
actuator  and  the  sensor  were  located  at  the  tip  of  the  beam.  The  first  3  modes  were  used.  The 
system  was  then  discretized  and  the  discrete  system  was  simulated.  The  sampling  frequency  was 
5  KHz. 

From  VanLandingham  [1 1]  the  equations  of  die  continuous  system  were: 

it  -A  X  *  Bu 
y  -C  X 

In  the  next  section  we  oresent  some  comparitive  results  using  FLC  and  a  standard  state  feedback 
controller  to  damp  the  first  mode  of  a  three-mode  system. 
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Simulation  raauita 

The  input  to  the  FLC  is  the  output  of  the  process  and  the  change  in  the  output  (since 
setpoint  =  0),  that  is: 

error  =  y(k) 

change  in  error  =  y(k)-y(k-1 ) 

The  output  of  the  FLC  is  the  control 
effort  to  be  applied  to  the  process  and  the 
scaling  factor  to  the  output  is  the  maximum 
control  effort  that  the  actuator  can  supply. 

The  scaling  factors  on  the  inputs  are:  the 
position  of  the  peak  of  the  PB  term,  the 
term  set  of  the  Fuzzy  partition,  and  the 
input  Universe  of  discourse  (Appendix  A. 

Fig  1). 

The  rule  base  used  is  as  appears  in 
Fig.  4  and  is  of  the  form:  if  y  is  "rowj”  and 
ce  is  "columnj"  then  output  is 
“elementji". 

The  defuzzification  strategy  used  is 
the  centroid  method  described  above  using 
sum-product  convolution. 

The  results  for  the  simulation  of  the 
controlled  system  using  different  scaling 
factors  on  the  input  and  output  spaces 
appear  in  the  final  four  figures.  The  3 
numbers  appearing  at  the  bottom  are  the 
scaling  factors  on  the  error(y),  the  change 
of  error,  and  the  output(u|.  As  the  output  scaling  factor  is  increased  (the  control  effort  applied), 
the  system  responds  faster,  however  if  it  is  increased  too  much  the  response  does  not  settle  to 
the  set-point.  If  the  input  scaling  factors  (especially  error)  are  decreased,  finer  control  is  obtained 
and  the  response  is  faster.  The  two  scaling  factors  on  the  output  and  the  input,  could  be  tuned 
and  then  be  reduced  as  the  set-point  is  approached,  thus  continuously  driving  the  system  to  the 
set  point.  These  scaling  factors  could  also  be  saved  in  the  form  of  look-up  tables,  increasing  the 
number  of  rules  of  the  system  but  also  increasing  the  adaptability  of  the  controller. 

For  the  optimal  control  simulation  the  assumption  was  that  there  was  only  information  on 
the  dynamics  of  the  first  mode  (since  FLC  does  not  use  any  knowledge  of  the  system  dynamics). 
The  response  was  never  driven  to  the  set  point,  but  only  oscillating  around  it  Fig.  8).  Also  with 
time  the  oscillations  grew  due  to  the  "spillover"  effects. 

Conclusion 

The  design  of  Fuzzy  Logic  Controllers  was  reviewed.  The  FLC  is  an  inherent  nonlinear 
control  method  which  implements  a  set  of  linguistic  (i.e.  understandable  to  a  human)  rules.  It 
typically  performs  much  better  in  the  absence  of  exact  knowledge  of  the  system  dynamics  and, 
additionally,  can  be  tuned  to  give  an  optimal  performance  for  a  particular  system.  A  comparison 
was  made  between  the  FLC  and  a  standard  optimal  state  feedback  controller.  The  FLC  is  shown 
to  have  a  clear  advantage  over  the  Optimal  Controller  because  information  on  the  system 
dynamics  was  restricted  to  one  mode  in  the  three-mode  system. 
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FIGURE  6.  Fuzzy  Control 
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FIGURE?.  F^issy Control 


HGURE  8. 

Optimal  Control  Response 
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Abstract 

This  paper  presents  a  ample  computational  algorithm  for 
determining  a  state  space  model  of  a  MIMO  Discrete  System  from 
Input-Output  Data  .  The  obtained  state  space  model  is  in  a 
Pseudo-Observable  Canonical  Form.  Unlike,  all  other  deterministic 
identification  procedures,  the  approach  suggested  in  this  paper,  does 
not  require  structural  identification  [5]  ,[6]  ,  i.e,  determination  of  a 
unique  set  of  observability  indices.  Instead,  as  it  has  been  presented 
elsewhere  [1]  ,[2],  a  MIMO  system  could  be  represented  in  pseudo- 
observable  canonical  form,  baaed  on  a  set  of  admissible  pseudo¬ 
observability  indices  [3]. 

Introduction 

Consider,  a  linear  time-invariant  discrete  system.  It  is  known,  [1]- 
[3],  that  based  on  a  selected  set  of  admissible  pseudo  observability 
indices^ 

n  =  I  n^,  n^,  .  .  i  n^  >;  p  being  the  number  of  inputs,  any 

n**  order  MIMO  discrete  system  could  be  represented  by  the 
following  pseudo-observable  canonical  form  : 

*  (  HI)  =  Ao  *  (  t )  -f  Bo  »  (  *  )  (  1  ) 

,  (  *  )  =  Co  r  (  t )  -f.  Bo  «  (  *  )  ;  (  2  ) 

*(0)=  xo  (3) 

where  x  r  R",  u  r  R™,  y  <  R**,  are  the  state,  input  and  output 
vectors  respectively  ,  while  Ao,  Bg,  Cg  and  Dg  are  matrices  of 
compatible  dimensions.  In  [3],  it  has  also  been  shown  that  the  total 
number  of  sets  of  admissible  pseudo  observability  indices  is  less 
than  .  1  \  f 

(-FllVn^p)!  • 

The  pair  (  Ao,  Cg)  in  the  pseudo  observable  form  is  characterised 
by  the  following  structure. 


Co=  0  1  0  ... 

0  0  10 


From  (  $  )  &  (  6  ),  it  can  be  concluded  that  Ao  has  only  p  rows 
with  non-zero  and  non-unity  elements.  Location  of  these  rows 
-  ~  I  *1’  ‘  ‘  uniquely  determined  by  the  set 

of  assumed  pseudo  observability  indices  n|'.  The  remmning  (  n  -  p  ) 
rows  of  Ao  correspond  to  the  last  (  a  -  p  )  rows  of  the  Identity 
matrix  /g.  The  first  p  rows  of  this  Identity  matrix  correspond  to 
the  rows  in  Cg.  Matrices  Bg  and  Dg  do  not  have  any  specific 
structure. 


*,I  • 

^nm 

dll  • 

‘'im 

v  - 

^pm 

“2n 

0  0  I 


Identification  Identity 

It  is  assumed,  that  the  Input-Output  sequences  : 
{»(*).»(*)};  t=  0,1,2 . JV-1  ; 

“  (  *)  =  «*  :  y  (  *)  =  s* 

corresponding  to  an  n^^  order  system  are  available.  The  approetch 
suggested  in  this  paper,  determines  a  system  representation 
satisfying  (1)  -  (3),  where  Ao,  Bg,  Cg  and  Dg  are  given  by  (  5  )  - 
(  8  ).  In  order  to  determine  non-zero  and  non-unity  parameters  in 
Aq,  Bg,  Cg,  Dg  and  Zg  the  following  procedure  is  suggested. 

From  (  I  )  -  (  3  ),  the  following  equation  is  obtained  : 

Y(t)  =  Qox(t)  -b  HU(k)  (9) 

where  in  (  9  )  the  matrices  Y(  k  )  and  U(  i  )  are  (l-fl)p  and  (l-l-l)p 
column  vectors,  while  Qo  and  H  are  matrices  of  the  dimensions 
(  1-b  l)px  n  ,  n  X  1  and  (  I  -I-  1  )px  (  I  -f  1  )  respectively  .  Qo  i* 
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llie  observability  matrix  of  the  pair  (  Ao  ,  Co  )  <uid  I  —  n-p+l. 
The  structure  of  these  matrices  is  as  follows  : 


U(t)  = 


y(t)  = 


^oAqBq  CqBo 


where  contains  the  first  m  (  n'  +  1  )  rows  from  U^,  wAere 

n'  =  mox  I  n  ^  J  ,  for  »  =  1,  ..  ..  p  and  tf  =  -  A'  . 

The  matrix  ^  j  ^ 

”  parameter  matrix  ”,  since  it  is  obvious  that  it  depends  only  on 
the  elements  of  Aq  >  Bot  Co  end  Dq  .  Using  a  set  of  y  aviulable 
measurements,  where  (  satisfies  the  condition  , 
n  +  (  n*  +1  )  m  <  j  ,  we  define  : 


-  [  ^24  ^'2(4+ 1) 


'2(4+,.l) 


|_'14  J 

The  dimensions  of  Y  and  Z  are  p  x  4  and  (  (  ti'+  1  )  m  +11 )  x  4, 
respectively.  Using  the  Least  Squares  Method  [  7  ],  from  (  15  )  - 
(  17  )  ,  the  parameter  matrix  can  be  expressed  as  : 


a'  J  =  YZ"^  {^Zz'^y^  (  18  ) 


where,  in  the  case  of  a  sufficiently  rich  input  signal  u(k)  and 
admissible  set  of  pseudo  observability  indices,  Z  is  a  full  row  rank 
matrix.  Since,  we  have  A',  the  matrix  Ag  is  determined  directly 
from  (  18  )  and  (  5  )  ,  while  Co  is  known  to  have  the  structure  as 
ill  (  6  ). 


Elements  of  vectors  *1.  end  are  related  to  the  elements 
in  Ao  ,  Bo  ,  Co  and  Do  by  the  set  oT  (  f  A-  1  )  p  scalar  equations 
given  by  (  9  ).  It  could  be  verified  that  among  (  /  +1  )p  rows  of 
the  observability  matrix  Qg,  there  are  n  rows  equal  to  n  rows  of  /„ 
and  p  rows  corresponding  to  the  non-zero,  non-unity  rows  from  Ao- 
The  locations  of  the  n  rows  corresponding  to  fn  and  the  p  rows 
corresponding  to  A  9  are  uniquiey  determined  by  the  assumed  set  of 
admissible  pseudo-observability  indices. 

Now  we  select  from  the  vector  y(  4  )  subvectors  yj^  and  Y^f^ 
corresponding  to  the  n  rows  in  Qo  containing  rows  equal  to  /n  and 
the  p  rows  in  Qo  containing  non-zero,  non-unity  rows  from  Ao- 
Let  the  corresponding  rows  from  H  be  //j  and  H2-  Then,  from 
equation  (  9  )  the  following  equations  can  be  formed. 

yi4  =  /  +  ffi  1^*  ( 12 ) 

+  («) 


It  can  be  easily  shown  that  Bo  —  Qe^  B* 

where  ^  5*  i  AgBe  I  ...  Ao^' Be  J 

The  columns  4^y  of  (  n  x  p  )  ”  equivalent  input  matrix  ”,  B^, 

contain  (  n  -1  )  zeros  and  only  one  unity  ,  whose  location  is 
determined  by  the  integers  of  the  set  S^,  where  Ag  has  non-zero, 
non-unity  rows  .  B*  is  formed  by  the  following  partitioning  of  . 
B'=[fl'o  •  B'l  i  ...  ;  J  (19) 


B*=  : 


A'=  ... 


“pn 


Therefore,  eliminating  the  state  vector  from  (  12  )  and  (  13  )  the 
following  Identification  Identity  is  obtained; 


IK 

A' J  ...  (15; 


Note  that  the  dimensions  of  Bf,  B*  and  Bf  ^  ,  t  =  0 . ,11', 

arc  p  X  m(nVl)  ,  p(  »*-H  )  x  m  and  p  x  m  ,  respectively. 

Determination  of  Dj^ 

From  the  definition  of  transfer  function  matrix  we  have  : 

G(z)  =  r^z)  N{x)  =  Co(  h-AoYho  +  Do  (21) 

or 

Do  =  F^t)  D(z)  -  Co  (D  -Ao  )'*Bo  (  22  ) 


where  Af(z)  and  I\z)  are  (  p  x  m  )  and  (  p  x  p)  co-prime 
polynomial  matrices  [4]  ,  where 


N{z)  and  F(z)  =s  ^  F^z*  .  Since  Do  on  the 

i=0  i=0 

I.I1.S  of  (  22  )  does  not  depend  on  the  Z-  tranafocm  variable  s,  it 
could  be  calculated  by  evaluating  the  r.h^  of  (  22  )  for  an  arbitrary 
value  of  z.  Since  a  discrete  time  system  has  no  eigenvalue  at  r  =  0 
,  it  follows  that  Fo  and  Ao  are  always  non-singular  .  From  ( 4  ]  it 
can  be  concluded  that  No^^  fi'n*  (  19  )  and  Fp  is  given  by  the . 
first  p  columns  of  A\  leading  finally  to  : 

Dp  =  No  +  CoAo'^Bo  (  23  ) 

Determination  pf  Initial  Condition  z ^ 

The  initial  condition  vector  Xp  ,  corresponding  to(l)-(3), 
could  be  calculated  directly  from  (  12  )  ,  by  setting  i  =  0  i.e 

=  ^lo  -  (  24  ) 

Selector  Vector  Algorithm 

1.  Define  a  Set  n  =  |  n.,  .  .  ,  np|,  of  admissible  pseudo 
observability  indices  where  p  is  tlie  number  of  outputs  of  the 
system,  p 

2.  Set  n  =  ^  n.  ;  where  n  is  the  order  of  the  system  , 

i=l 


n'  =  max  {  n-  |  and  np  =  (n*-)-l)  *  p  . 

3.  Set  i  =  1.  •' 

4.  Set  h  =  i  . 

5.  For  j  =  1  through  np  ,  Set  Vx(  h)  =  n.  -Hl-j,  h  =  h-i-p. 

6.  Set  i  =  i  +  1  . 

7.  If  i  >  p  ,  go  to  6  :  else  ,  go  to  4. 

8.  Set  i^  =  0  ,  ij  =  0  ,  k  s  1. 

9-  If  (  k  )  <  0  ,  go  to  11;  else  ,  go  to  10  . 

10.  Set  i.  =  i.  -b  1,  j}(  i.  )  =  k. 

11.  If  (  k  )  #  0  ,  go  to  13;  else,  go  to  12. 

12.  Set  io  =  io  +  1,  1(10)  =  k  • 

13.  Set  k  =  k  -b  1. 

14.  If  k  <  np  ,  go  to  9  ;  else,  go  to  IS. 

15.  Set  q  =  p  -b  1,  ia  =  0  ,  ip  =:  0,  ij  =  0. 

16.  Set  ip  Ip  ^  1. 

17.  If  Vx(  q  )  <  0  ,  Set  ip  =  ip  -  1  and  go  to  21;  else,  go  to  18. 

18.  If  Vx(  q  0  ,  Set  ij  =  ij  -b  1;  else,  go  to  20. 

19.  Set  Sc  ( i,  )  =  ip. 

20.  Set  ia  =  la  +  1  ,  $  (  ia  )  =  ip  ,  q  =  q  -b  1. 

21.  Set  q  =  q+I  . 

22.  If  q  <  np  ,  go  to  16;  else.  Stop. 

bamplt 

For  p  =  3  and  0  =  |  1  ,  4 , 2  |  ,  the  above  suggested 
algorithm  gives  the  following  selector  vectors  : 

,  =  {1.5,7}.  h  ={7, 5, ,,5}, 
b  =  |l,2,3,5,6,8,ll}andr  =  ^4,9,14}. 
Identification  Algorithm 

1.  Given  an  Input-Output  sequences  |u(k),y(k)j, 
k  =  0  ,  1  ,  .  .  ,  N-1  ,  corresponding  to 

an  n^^  order  Linear  MIMO  discrete  system, 
u  (  k  )  6  R"*  ;  y  (  k  )  e  R**  (25; 

where  1  <  m  <  n  and  I  <  p  <  n  . 

2.  Assume  a  set  g  =  |  itj  ,  ,  •  .  ,  Up  |  , 


of  admissible  pseudo  observablility  indices  which  satify  the 
P 

condition  n  =  ^  nj  ;  1  <  n|  <  n  -  p  -b  1,  ( 

Let  n'  4  max  |  n^j  ,  n^  ,  .  .  ,  nj,  J  . 

Build  the  (n*-bl)xNj  matrix  U  ,  where 
Ni5"‘(“^+l)i  defined  by  ; 

U(0)  ...  U(Nj-l) 

U(l)  ...  U(Ni) 


I^U(n')  ...U(Ni-bn'-l)J 

If  the  matrix  U  is  of  the  full  row  rank  ,  i.e 
p|  U  I  =  (  n'  -b  1  )  m  ,  (28) 

then  the  input  sequence  is  ”  sufficiently  rich  '  and  is  capable 
of  exciting  all  the  n  modes  of  the  system  to  be  identified. 
Ifp|U|<(n'-bl)m,  then  either : 

(  a  )  Select  another  set  of  pseudo  observability  indices  having 
a  smaller  value  of  n' 

or 

(  b  )  Select  a  different ,  more  rich  input  sequence  which  would 
satisfy  the  richeness  conditon  (  28  ). 

3.  Using  the  selector  vector  algorithm  determine  the  following: 

(a)  A  set  of  integres. 


8f^  —  ^  Sj,  .  .  , 


corresponding  to  the  locations  of  non-zero,  non-unity  rows 

»i  =  I  *11  .  •  •  .  *!„  I  (30) 

of  the  matrix  Ao  of  the  pseudo  observable  canonical  form. 

(b)  A  set  of  (  n  -  p  )  integers  (  complement  to  the 

set  ^  )  corresponding  to  the  locations  of  the  last  (  n  -  p  ) 
rows  from  the  Identity  matrix  In  in  the  matrix  Aq. 

(c)  A  set  of  n  integers  : 


b'={h\,  ..  ,h{,} 


corresponding  to  the  location  of  the  n  rows  of  In  in  the 
observability  matrix  : 


CqAo 
Qo=  • 


(d)  A  set  of  p  integers  ; 

=  {  r*j  ,  .  .  ,  rjj  J  , 

corresponding  to  the  locations  of  the  rows  a.  in  the 
observability  matrix  Qq  .  ’’ 

4.  Build  the  p  (  n^  -b  1  )  X  N.  matrix  Y  ,  defined  by  ; 

V(0)  ...  V(Nj-l) 

Y= 

Y(n')...  V(Nj+n'-l) 


5.  From  the  matrix  Y  select  the  (  n  x  Nj)  matrix  with  n 
rows  corresponding  to  the  elements  in  .  This  selection  could 
be  represented  by  the  following  premultiplication; 

Yj  =  Sfc  X  Y  (35) 

where  S|,  is  the  "  selector  matrix  ”  of  dimension  (  n  x  N|  )  . 
The  row  of  have  (  •  1  )  zeros  and  a  unity  at  the 

location  specifled  by  the  integers  hj^  of  the  set  h^  . 

6.  From  the  matrix  Y  select  the  (  p  x  Nj  )  matrix  Y^  with  p 
rows  corresponding  to  the  elements  in  r^.  This  selection  could 
be  represented  by  the  following  premultiplication  ; 

Yj  =  Sr  X  Y  (36) 

where  Sr  is  the  "  selector  matrix  "of  dimension  (  p  x  )  . 
The  k^^  row  of  Sr  have  (  -  1  )  zeros  and  a  unity  at  the 

location  specified  by  the  integers  of  the  set  ^  . 

*  fu' 

7.  Build  the  matrix  Z  ;  Z=  ...  ;  (37) 

If  the  matrix  Z  is  of  full  tow  rank  ,  then  the  selected  set  of 
pseudo  observability  indices  are  admissible. 

8.  Using  the  Least  Squares  algorithm  determine  the 
p  X  ((  n*  +  1  )  m  +  n]  "  parameter  matrix  "  Q  , 
where  Q  a  (  B'  f  a'  ] ,  satisfying 


Y2  =  QxZ  =  [B'iA'l  ...  . 


9.  B'  and  K'  are  p  x  (  n'  -f  1)  m  and  (  p  x  n  )  matrices, 

respectively  .  The  p  rows  in  the  matrix  M!  correspond  to  the 
non-zero,  non-unity  rows,  (  30  )  ,  of  the  matrix  Aq.  Thus 
having  determined  the  parameter  matrix  Q  ,  i.e  the  matrix 

a',  . 


A'=  i 


The  matrix  Aq  in  the  pseudo  observable  canonical 
form  can  be  easily  determined  using  the  sets  $*snd 

10.  As  it  has  been  mentioned  earlier,  the  matrix  Cq  in  the  pseudo 
observable  canonical  form  is  always  of  the  structure  is, 

Co  =  [  Ip  :  0  ],  i.e  it  contains  the  first  p  rows  from  the 

identity  matrix  Ig.  The  input  matrix  Bo  is  to  be  determined 
by  the  following  procedure. 

11.  The  px(n'-fl)m  matrix  B'  is  partitioned  into  n'  +  1, 


(  p  X  m  )  submatrices  6.  , .... ,  B  ,  i.e  : 

«  n 

=  t  •  •  • :  1  (40) 

12.  Let  the  columns  b^j  of  the  (  n  x  p  )  matrix  B*,  ’  equivalent 
input  matrix  "  ,  contain  (  n  -  1  )  zeros  and  only  one  unity  , 
whose  location  is  determined  by  the  integer  ^  of  the  set  gi  . 

13.  Build  the  [nx(n'4-l)p]  controllability  matrix  of 
the  pair  I  Ao  .  Be  }  : 

Qc  =  [  Be  ;  AgBe  :  ...  •  Ao”  Be  ]  (41) 

14.  The  input  matrix  Bq  of  the  pseudo  observable  canonical  form 

can  now  be  easily  determined  from  Bg  =  Qj  B*  (42) 


where  B*  =  I  ; 


The  ”  direct  path  ”  feed  -  through  matrix  Do  in  the  pseudo 
observable  canonical  form  is  determined  in  the  following  way. 

15.  We  know  that , 

G(z)  =  Co  (  Is  -Ao  )*^  Bo  +  Do  =s  D"I(  s  )  N  (  s  )  for  ail  s, 

1  1 
Do  =  D-J(i)N(i)-Co(h-Ao)  -^Bo. 

Since  the  matrix  Do  does  not  depend  on  z.  Do  can  be 
determined  by  evaluating  the  above  equation  at  s  =  0,  i.e 

Do  =  D-‘(0)N(0)-hCo  Bo.  (45) 

16.  From  the  identification  identity  it  can  be  seen  that 

N  (  0  )  =  B'q  and  D  (  0  )  =  Aj  ,  where  (  p  x  p  )  matrix  A., 
contains  the  first  p  columns  from  A\  i.e  : 

A'  =  IAj:A2]  (46)] 

17.  The  initial  condition  vector  x(o)  =Xo  can  be  determined  by 
the  following  equation  ; 

y  (  o  )  =  Xo  -  u(  o  )  (47) 

where  y  (  o  )  is  the  first  column  of  the  matrix  Y.  and  u  (  q  ) 
is  the  first  column  of  the  matrix  U  while  11,  is  formed  as 
follows  ; 

Hi  =  *  H  (4g) 

where  the  (  n  x  N  j  )  selector  matrix  S|j  is  given  by  (  35  ).  j 
Illustrative  Examnie 

In  order  to  generate  the  input/output  sequence  |  u(k),  y(k)  |1 
which  will  be  used  in  the  indentification  algorithm  the  following] 
example  is  considered.  ' 

X  =  A  X  -t-  B  u 

y  =  Cx-«-Du,x(o)  =  Xo  (49)j 
where  I 


(50) 


A  = 


.1  0  0  0  0 

0  .2  0  0  0 

0  0  .3  0  0 

0  0  0  .4  0 

0  0  0  0  .5 


B=: 


1  1 
.01  1 
.02  1 
0  1 
0  1 


Xo  = 


1.010  0  o' 

0  o' 

c  = 

0  0  1  .01  0 

D  = 

1  0 

11111 

0  0 

i.c  n=5,  m=2  and  p=3.  For  the  input  sequence  u(k),  the  following 
inx  N  ,  N  =  17  ,  matrix  consisting  of  pseudo  random  numbers  is 
selected. 


u(k)  = 


0.448  2.223  1.885  1.900  0.380  2.349  0.827  -0.415  0.654 
1.817  1.792  -0.279  0.382  0.776  -0.391  2.266  -0.25  0.091 

-0.423  0.782  -0.270  -0.064  1.182  1.151  0.907  0.616' 
-0.249  2.489  1.394  1.922  2.371  2.360  1.1 


1.907  0.616  1 
1.876  2.172  S 


The  ouput  sequence  y  (  k  )  corresponding  to  (  49  )  and  (  50  )  is: 


y(k) 4  o'i 

I  7.1 


1.020  1.491  4.184  2.025  2.488  1.413  2.096  3.326 

572  4.357  4.382  2.408  0.957  3.313  0.769  1.873 

000  10.722  14.675  4.768  5.197  5.556  2.122  12.472 


-0.105  0.736  -0.601 
1.309  -0.120  0.613 

3.012  2.335  -0.965 


3.235  1.464  2.026  3.782  3.916  3.197 

2.084  2.209  3.769  4.349  4.277  3.543 

13.112  10.489  12.986  17.277  18.285  15.977 


According  to  [3],  it  follows  that  in  the  case  of  n  =:  5  and  p  =  3  , 
the  total  number  of  possible  seta  of  psuedo  observable  indices  is 
equal  to 

I  =  ,0  =  6.  These  6  sets  of  indices  are  {  1,1,3  }  , 

(3-1):  (5  -  3)! 

{  1,2,2}  ,  {  2,1,2  }  ,  {  2,2,1  }  ,  {  1,3,1  }  and  {  3,1,1  }.  In  case  of 
(  50  )  ,  it  can  be  concluded  that  the  sets  {  3,1,1  )  and  {  1,3,1  ) 
arc  non-admissible  and  that  the  set  {  2,2,1  }  corresponds  to  the 
unique  set  of  pseudo  observability  indices  [3],[4]  and  [6]. 

Using  the  available  input/output  sequence  and  applying  the  above 
mentioned  algorithm  for  all  the  admissible  sets  of  pseudo 
observable  indices,  we  get  : 

Ca.se  I  i  a*  =  {  1.1|3  }•  ^“"8  t***  selector  vectors  algorithm 

gives  ; 

,l={l,w},4  =  {3.<}.b‘.{u.W,9}4 
ll={  4,5,12}. 

After  building  the  identification  identity  (  15  )  &  (  18  )  ,  the 
following  parameter  matrix  Q  is  obtained  as  in  (  38  ),  where  B' 


II 

< 

.098 

.000  .003 

-.015 

.017 

.002 

.299  -.005 

.036 

-.051 

-.012 

.002  .041 

-.383 

1.103 

r.014 

1.061 

.017 

-.084 

.000 

.000 

-.310 

.908  1.052 

.254 

.000 

.000 

.285 

.808  - 

1.028 

-4.015 

1.030 

5.00Q 

Thus  the  system  matrix  A©  in  the  pseudo  observable  canonical 
form  is; 


T098 

.000 

.003 

-.015 

.017 

.002 

.299 

-.005 

.036 

-.051 

.000 

.000 

.000 

1.000 

.000 

.000 

.000 

.000 

.000 

1.006 

-.012 

.002 

.041 

-.383 

1.103 

A©  — 


The  equivalent  input  matrix  B©  used  in  calculating  B©  is  : 


B©  = 


The  initial  condition  vector  is  calculated  by  using  step  (  17  )  of 
the  algorithm; 

■■  1.020 


"l.O 

0.0 

0.0 

1.000 

1.010  ■ 

0.0 

1.0 

0.0 

0.020 

1.010 

0.0 

0.0 

0.0 

leading  to  B©  = 

1.030 

5.000 

0.0 

0.0 

0.0 

0.108 

1.500 

0.0 

0.0 

1.0 

0.012 

0.550 

X©  = 


1.020 

7.000 

2.100 

0.750 


The  same  procedure  is  followed  for  the  other  three  sets  of 
admissible  pseudo  ohservablitiy  indices  and  following  results  are 
obtained: 


•1  Csss  11 ;  0^  =  { 

1,2.2  },  8^=  {  1,4,5  }  ,  8?  =  {  2,3  } 

/  h2=|  1,2,3,5,6  ' 

[  &  1^=  {  4,8.9  }. 

B‘T= 


An— 


.099  .090  .002  -.331  -.003 
.000  -.120  .000  .700  .000 
.041  5.881 -.101 -19.7  .701 


B.= 


- 

-1 

7912 

1.361 

.331 

.000 

.000 

.1 

.112 

-.403 

-.680 

1.010 

I.OOO 

.1 

6.102  17.857 

20.698 

5.000 

.000 

.( 

.099 

.099 

.002 

-.331 

-.003 

.000 

.000 

.000 

1.000 

.000 

.000 

.000 

.000 

.000 

1.000 

.000 

-.120 

.000 

.700 

.000 

.041 

5.881 

-.101 

-19.7 

.701 

1.0  0.0 

0.0* 

‘  1.000 

1.010 

- 

0.0  0.0 

0.0 

0.020 

1.010 

0.0  0.0 

0.0 

B©  = 

1.030 

5.000 

0.0  1.0 

0.0 

0.006 

0.304 

0.0  0.0 

1.0 

0.108 

1.500 

Xo  = 


1.020 

1.020 

7.000 

0.308 

2.100 


CaasIUl  n*  =  |2,l,2}. 

S®=  {  2,4,5  }  ,  s?  =  {  1,3  },  li’={  1,2,3,4,6  }  fc  I^=  {  5.7,9  }. 


.298  .298  .005  -3.02 -.010 
A'^=  -.020  .000  .000  .300  .000 
-5.82  .020  -.201  -59.4  .902 


b'T= 

2.753 

4.112 

1.000 

.000 

.000 

.000 

-.200 

-.201 

1.000 

1.010 

.000 

.000 

-60.3 

-63.0 

1.030 

5.010 

.000 

.000 

Be= 


Tooo 

.000 

.000 

1.000 

.000 

.298 

.298 

.005 

-3.02 

-.010 

.000 

.000 

.000 

.000 

1.000 

-.020 

.000 

.000 

.300 

.000 

-5.82 

.020 

-.201 

-59.4 

.902 

0.0  0.0 

o.o' 

■  1.000 

1.010 

1.0  0.0 

0.0 

0.020 

l.Otli 

0.0  0.0 

0.0 

II 

c§ 

1.030 

5.000 

0.0  1.0 

0.0 

0.100 

0.102 

0.0  0.0 

1.0 

0.108 

1.500 

Xo  = 


1.020 

1.020 

7.000 

0.104 

2.100 


Cassliil  !}'*={  2,2.1  }. 

{  3,4,5  }  ,  s|  =  {  1.2  }.  h^={  1.2,3,4,5  }  <£  1“*=  {  8.7.8  }. 


A'T= 


B''r= 


29.2  29.2  .5  -296.0  -98 
-.020  .000  .000  .300  .000 
.000  -.120  .000  .000  .700 


269.8 

402.9 

97.9 

.000 

.000 

.000 

-.200 

-.201 

1.000 

1.010 

.000 

.000 

.112 

-.403 

-.680 

1.010 

1.000 

.000 

Tooo 

.000 

.000 

1.000 

.000 

.000 

.000 

.000 

.000 

1.000 

29.2 

29.2 

.500 

-296 

-98 

-.020 

.000 

.000 

.300 

.000 

.000 

-.120 

.000 

.000 

.700 

0.0 

0.0  o.o" 

■  1.000 

1.010 

- 

0.0 

0.0  0.0 

0.020 

1.010 

1.0 

0.0  0.0 

B,= 

1.029 

5.000 

0.0 

1.0  0.0 

0.100 

0.102 

0.0 

0.0  1.0 

0.006 

0.304 

Xo  = 


1.020 

1.020 

7.000 

0.104 

0.304 


{  2  ]  .  BinguUc,  S.  ,  and  Krtolica  ,  R  .  V.  (  1988  )  .  An 

Algorithm  for  Simultaneous  order  and  paramter  identification  in 
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Conclusion 

The  main  purpose  of  this  paper  is  to  show  that  in  identifying 
MIMO  systems,  the  number  of  possible  state  apace  representations 
corresponds  to  the  number  of  admissible  sets  of  pseudo  observable 
indices  and  that  one  of  these  sets,  not  necessarily  equal  to  the 
unique  set  of  pseudo  observable  indices  ,  leads  to  the  most 
coiivienient  representation  involving  manipulation  of  well 
conditioned  matrices.  In  this  case  the  matrices  Ao  and  Bq  have 
elements  with  relatively  small  absolute  values,  which  can  be 
verified  by  comparing  the  Case  I  with  the  other  three  cases  in  the 
above  illustrative  example. 
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Abstract : 

This  paper  presents  a  simple  computational  procedure  for  stabilizing  lightly  damped  systems  by 
infinitisimal  shifting  of  th  real  parts  of  the  eigenvalues  of  the  system  using  a  proportional  output 
feedback  regulator  matrix.  The  shifting  of  the  eigenvalues  is  achieved  by  continous  increments  in  the 
real  parts  of  the  system  modes.  The  output  feedback  regulator  matrix  performing  the  shifting  is 
obtained  by  solving  the  Lypanov  Algebnuc  Matrix  equation.  This  approach  facilitates  stabilizing  any 
particular  or  all  modes  of  the  system  by  shifting  the  corresponding  eigenvalues  as  far  as  desired  from 
the  imaginary  axes. 


Consider  a  state  space  representation  of  a  linear  time-invariant  multivariable  dynamic  system. 

x  =  Ax-|-Bu;y=Cx-fDu  (1) 

where  x  €  R°  ;  u  6  R™  ;  y  6  R^  are  the  state,  input  and  output  vectors  respectively,  while  A,  B,  C 
and  D  are  the  matrices  of  compatible  dimensions  with  B  and  C  of  maximum  rank  ,  i.e 

pI  B|  =  m  and  p|  C  |  =  p  ,  respectively.  (  2  ) 

Without  loss  of  generality  the  following  assumptions  are  made  : 

[l].m  =  p  =  ^.  (3) 

[  2  ]  .  All  eigenvalues  Aj  ,  i  =  1,  .  .  .  ,  n,  of  the  matirx  A  are  distinct  and  have  negative  real  parts. 

[  3  ]  .  All  eigenvalues  Aj  appear  in  complex-conjugate  pairs,  i.e  Aj_j^j  =  Aj  ,  i  =  1,  3,  ...  ,  n  -  1  ,  where 
Aj  =  -  o-j  -t-  j  wj  and  Aj^j  =  -  or.  -  j  wj  . 

[  4  ]  .  The  complex-conjugate  pair  {  A-  ,  A.  , ,  }  ,  will  be  referred  to  as  the  j*^  system  mode  m.  , 

.  .  n-hU  ‘  .  .  ...  J 


where  j  = 


Thus,  according  to  (  3  )  ,  in  A  there  are  p  oscillatory  modes. 


mj  =  -ffj  ±  j  wj  ;  j  =  1  ,  ...  ,  p  . 

[  5  ]  .  The  system  modes  are  arranged  according  to  their  magnitudes  i.e  : 
1  mj  I  >  I  m2  I  >  .  >  I  mp  I  ,  where  |  mj  |  =  ^  -h 


With  a  sequence  of  similarity  transformations  (  appendix  I  )  ,  a  given  system  representation 
{  A,  B  ,  C  ,  D  }  can  be  transformed  into  the  representation  {  A  ,  B  ,  C,  D  }  where  ; 


A  =  T  A  T 


1 


B  =  T  B 


The  matrix  A  in  (  4  )  has  the  following  structure  : 

A  = 


•,0  =  61^1  and 
0  :  I, 


D  =  D. 


(4) 

(  5  ) 


jj"  ”DoJ 

where  O  and  Im  are  (  m  x  m  )  zero  and  identity  matrices,  respectively  ,  while  TIq  and  D©  are 
diagonal  (  m  x  n  )  matrices  ,  i.e  :  fig  =  diag  |  |  and  D©  =  diag  {  2  <r j  |  (  6  ) 


Infinitisimal  Shifting  qf  Real  Parts  qf  Eigenvalues  ^ 

Using  an  (  m  X  m  )  proportional  output  feedback  regulator  matrix  K  ,  i.e 
the  closed  loop  system  matrix  Ac  becomes  :  Ac  =  A  -  B  K  C 

Partitioning  B  and  C  into  (  p  x  p  )  blocks  Bj^  ,  B2  ,  and  C2  we  get  ; 


B  = 


and 


u  =  -  Ky 


(7) 

(8) 

(9) 


The  closed  loop  matrix  Ac  may  be  written  as  : 


Ac  =  A 


BjKCj 

B1KC2 

— 

-BjKCj 

B2KC1 

B2KC2 

“Qq"® 

-Do'B2KC2 

(  10) 


The  output  feedback  regulator  matrix  K  surely  affects  all  mj  system  modes  ,  i.e  all  the  system 

eigenvalues  Aj.  The  results  of  the  above  calculations  are  given  in  Appendix  2  . 

♦ 

Consider  the  matrix  Ac  given  by  the  following  properties  : 


Ac  —  A  + 


■Di+  El 


E, 


E2-E1 


(11) 


where  the  matrices  Dj,  62  and  Ej  ,  i  =  1,2,3  ,  have  the  following  structures: 


Dj  =  diag|  26lj  }  ,  £>2  =  diag  |  26^.'^  ;  |  6*j  |  <  1  and  1  Ej  |  =  fj  ;  Cj  <  1.  (  12  ) 

Then  ,  the  modes  mj®  ,  i.e  the  eigenvalues  A^j  ,  of  the  matrix  Ac  are  such  that 


The  quantities  could  be  interpreted  as  increments  in  the  real  parts  of  the  modes  m?.  Comparing  (  10 
J  J 

)  and  (  11  )  ,  it  can  be  concluded  that  in  order  to  shift  the  modes  mj  to  the  position  m^'j  ,  the  matrix 
K  should  satisfy  the  following  equations: 

BjK  Cj  +  B2K  C2  —  ^2  +  ^2 


or 

K  CjC  €2)'^  +  (  B2)-^B2  K  =  (  Bj  )*^  D  (  C2  )'^  ;  where  D  =  Dj  +  D2  (  14  ) 

Note  that  the  equation  (  14  )  is  the  Lypanov  Algebraic  Matrix  Equation,  which  could  be  solved  for  the 
unknown  matrix  K  ,  provided  that  ; 

(  a  )  .  The  matrices  B2  and  C2  are  non-singular  and, 

(  b  )  .  All  the  eigenvalues  A^j  and  A^j  of  the  matrices  (  Bj)'^  B2  and  C2  (  C2  )'^  satisfy  the 
conditions  A^j  4-  ^  0,  V  i  and  j  . 

In  case  of  (  2  )  ,  conditions  (  a  )  and  (  b  )  are  generically  satisfied. 

Defining  D  =  diag  |  2Aj  |,  Aj  <  1,  the  equation  (  14  )  could  be  solved  for  the  unknown  K.  From 
the  equations  (  10  )  and  (  11  )  it  could  be  concluded  that  the  real  parts  of  of  the  eigenvalues  of  the 
closed  loop  matrix  Ac  ,  (  8  )  ,  will  satisfy  the  relation  : 

stabilizing  Lightly  Damped  Systems. 


Assume  that  the  system  (  1  )  is  lightly  damped  ,  i.e  its  eigenvalues  are  relatively  close  to  the 
imaginary  axis.  Also  assume  ,  that  it  is  desired  to  move  some  of  these  eigenvalues  farther  left  in  the  s- 
complex  plane.  This  could  be  achieved  by  a  sequence  of  infinitisimal  shifting  of  the  real  parts  as 
explained  in  the  previous  section. 

The  following  algorithm  is  proposed: 

[  1  ]  .  Representation  {  A,  B,  C,  D  }  of  a  lightly  damped  system  is  given  . 

Set  the  iteration  counter  I;  =  1. 

The  representation  will  be  denoted  as  {  A^,  Cp  }. 

Define  a  (  p  x  p  )  zero  matrix  ,  Kp 

[  2  ]  .  By  the  sequence  of  similarity  transformations  (  Appendix  1  )  ,  the  representation 
{  Aj^  ,  B^,  Cj|.,  Dj|.  }  is  transformed  into  {  Aj^,  Dj^  }  ,  (  4  )  , 

where  Aj^  satisfies  the  property  (  5  )  . 

[  3  ]  .  Define  a  diagonal  matrix  D^^  =  diag  |  2Ajj^  }  »  0  <  1  •  Non-zero  Aj  indicate 

which  modes  are  desired  to  be  infinitisimally  shifted  to  the  left  in  the  iteration. 


[  4  ]  .  Solve  the  Lypanov  equation,  i.e 

Ki  (  Ojj  )-l  +  (  )■•  Bjj  Kj  =  (  B,,)-!  Dj  ( 

Set  Kj  Kjt  =  Kp 

[  5  ]  .  Build  the  closed  loop  system  representation  {  A^j^,  Bj^,  D^^  },  where 

=  Ajt  -  Bit  K* 

Modes  of  the  closed  loop  system  in  the  it*^  iteration  are  related  to  the 
modes  mj^  of  (  16  )  by  the  relation  :  ~  ™jjt  ^jit  ‘ 

[  6  ]  .  Increment  the  iteration  counter  by  1,  i.e  set  t  ~  it  -i-  1  and  set 

"^it  =  \{k-l)  ’  ®it  =  B(it-l)  ’  =  ^(it-1)  ’  ^k  =  ^(t-1)- 


(  17) 

(  18) 


[  7  ]  .  If  the  modes  of  the  representation  {  A,  B,  C,  D  }  are  sufficiently  shifted  stop,  ELSE 
Go  to  step  [2]  . 


**  Appendix  1  and  2  ,  an  Illustrative  example,  the  Flowchart  of  the  algorithm  suggested  and 
references  will  be  included  in  the  final  version  of  the  paper. 
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Abstract 

The  Gaussian  sum  estimation  method  developed  by  Sorenson  and  Alspach  [1]  is 
investigated.  An  alternate  development  of  the  Gaussian  stim  density  approximation 
is  given  and  a  modified  adaptive  estimation  structure  is  proposed  based  on  the 
adaptive  Kalman  filter  scheme  first  presented  by  Magill  [2],  and  extended  by  Moose 
and  Wang  [3].  A  necessary  condition  for  effective  estimation  is  examined  [5].  Several 
alternate  configurations  are  proposed  when  this  condition  is  not  met.  Simulation 
results  illustrating  the  theory  are  analyzed.  A  suboptimal  method  of  tuning  the 
parameters  of  the  adaptive  structure  is  suggested. 


Introduction 

An  estimation  technique,  applicable  to  both  linear  systems  with  nonGaussian 
inputs  and  nonlinear  systems  with  Gaussicin  inputs,  has  been  developed  by  Sorenson 
and  Alspach  [1].  In  the  case  of  linear  systems,  the  noise  processes  associated  with  the 
plant  and  measurement  are  assumed  known  and  nonGaussian.  The  probability 
density  function  of  each  noise  process  is  approximated  by  a  Gaussian  sura;  that  is,  a 
weighted  sum  of  Gaussian  density  functions.  The  Gaussian  sum  approximation  is 
written  as 

PGs(a:)=  (1) 

where 


and 


M 

^0^  =  1;  Oj  >  0  for  i  =  1,2,. . M 

«=i 


(2) 

(3) 
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For  sufficiently  large  M,  any  density  function  can  be  closely  approximated  by  a 
Gaussian  sum.  As  long  as  condition  (2)  holds,  the  Gaussian  sum  is  always  a  valid 
density  function. 


Alternate  Gaussian  Sum  Development 

In  developing  their  estimator,  Sorenson  and  Alspach  begin  by  directly 
representing  the  plant  and  measurement  noise  processes  by  Gaussian  sums.  The 
conditional  density  p{xif  |  of  the  current  state  vector  given  the  current 
measurement  sequence  =  {zi,Z2,...,Zk}  is  then  updated  using  Bayes’  rule. 

A  major  problem  with  their  development  is  that  the  number  of  terms  in  the 
Gaussian  sum  increases  at  each  iteration  of  their  estimator.  This  problem  of  growing 
memory  can  be  avoided  by  the  following  alternate  development  of  the  Gaussian  sum 
density  approximation.  The  nonGaussian  noise  process  or  input  signal  is  modeled  as 
the  sum  of  two  statistically  independent  random  processes 

u  =  6  +  n  (4) 

The  first  term,  6,  is  a  semi-Markov  process  with  state  transitions  governed  by  the 
transition  probability  matrix  of  a  conventional  Markov  process.  However,  the 
amount  of  time  spent  in  state  i  before  the  next  transition  to  state  j  is  a  random 
variable  [4].  Its  values  lie  within  a  fixed  set  of  discrete  values,  characterized  by  a 
delta  probability  density  function 

M 

p(b)='£PM->'i)  (5) 

with 

M 

£P,  =  1;  P,>0  fori  =  l,2,...,M  (6) 

»=i 

This  process  can  be  thought  of  as  a  randomly-switching  bias,  each  bias  value  6, 
having  probability  P,-.  The  second  term,  n,  is  a  zero  mean  Gaussian  process  with 
variance  a].  The  variance  switches  at  the  same  time  the  bias  switches.  This 
switching  variance  can  be  thought  of  as  chamging  the  power  of  the  Gaussian  process. 

With  both  densities  known,  and  using  the  convolution  relationship  between  u,  n, 
and  6,  the  density  function  of  u  is  given  by 

M 

(7) 

1=1 

Thus,  the  nonGaussian  density  function  of  u  can  be  modeled  as  a  Gaussian  sum.  The 


weight  of  each  Gaussian  term  is  the  probability  of  the  biasing  term.  By  allowing 
only  a  fixed  number  of  bias  values,  the  number  of  terms  in  the  Gaussian  sum  at  each 
iteration  of  the  estimator  is  fixed,  thereby  eliminating  the  growing  memory  problem. 

Selecting  the  parameters  P,-,  6^,  and  in  (7)  to  obtain  the  “best”  approximation 
Pqs  to  some  actual  nonGaussian  density  function  p  is  accomplished  by  means  .of 
minimizing  the  norm 


HP 


fOo  M 

-  Vgs  h''  =  [  P(«)  -  £  -Pi  N[6i,  a?] 

•>  -  oo  ,=1 


du 


(8) 


This  curve  fitting  exercise  can  be  done  off-line  using  several  values  of  M  until  a 
suitable  trade-off  between  minimum  norm  and  minimum  M  is  obtained. 


Development  of  a  Modified  Gaussian  Sum  Adaptive  Filter 
A  modified  Gaussian  sum  adaptive  filter  is  now  developed  for  a  general  linear 
system  with  a  nonGaussian  input  signal  and  Gaussian  plant  and  measurement  noise 
processes.  The  system  is  modeled  in  standard  discrete-time  state-space  form  as 


where 


Xk+i  =  ^Xk  +  Tui,  +  ^iv^  (9) 

Zk  =  Hxk  +  Duk  +  Vk  (10) 

=  h  +  (11) 


X 

z 

w 


V 


^,r,  <5,77,0 
u 


is  the  state  vector 

is  the  measurement  vector 

is  a  zero  mean  white  Gaussian  plant  noise  process 
with  covariance  Q 

is  a  zero  mean  white  Gaussian  measurement  noise 
process  with  covariance  i?,  independent  of  w 
are  the  respective  constant  transition  matrices 
is  the  vector  Gaussian  sum  signal  model  (4)  of 
the  actual  “’onGaussian  input  signal,  comprised  of 
semi-Markov  bias  vector  and  zero  mean  white 
Gaussian  noise  with  covariance 


The  optimal  estimate  of  the  state  vector  is  found  from  the  conditional  mean  as 

,oo 

h+i  =  ^[^k+i\^k+A=  \  Xk+\P{^k+\\Zk+i)  dxk+i  (12) 

J  —  oo 

where  is  the  measurement  sequence  {z^,  Z2»  •••>  Using  Bayes'  theorem, 

the  conditional  density  function  of  (12)  can  be  written  as 


(13) 


The  bias  vector  6jt  and  covariance  matrix  5^  axe  explicitly  brought  into  (13)  by 
considering  the  joint  density  p(a:fc+i,2fc+i)  to  be  a  marginal  density  found  from 
P(®fc+u^fc+ii^fc»‘S'fc)  by  summing  over  the  6^  and  terms 

M 

p(®fc+l>  ^fc+li  ^i»  “^i) 

P(®fc+1  I  ^fc+l)  = 

After  some  additional  algebraic  manipulation,  the  optimal  estimate  of  (12)  becomes 

^fc+i  =  S^fc+i  viKSi  I  Zfc+i)  (15) 

i=l 

where  is  the  conditional  mean  estimate  of  given  that  6^  =  ^^d  5^  =  5,-, 

denoted  by 

x'k+i  =  ®fc+i  p(®ik+i  1  Zk+iA^Si)  dXfc+1  (16) 

J  —  oo 

and  the  weighting  factor  p(fc„5,- ]  Zjt^i)  is  the  probability  that  fe/t  =  *>^d  5^  =  5,- 

given  the  current  measurement  sequence.  The  overall  estimate  of  (15)  is  then  a 
weighted  sum  of  individual  estimates,  each  based  on  a  paxticulcir  set  of  pairameters  b, 
and  5j. 


Each  estimate  (16) 

is  found  by  a  modified  Kalman  filter. 

Inserting  (11)  into  (9) 

and  (10)  produces 

^fc+i  =  +  r&fe  +  ly'fc 

(17) 

=  Hxfc  +  Dhf^  +  v'k 

(18) 

where 

w'lf  =  Fn^  + 

(19) 

v'k  =  Duk  +  Wfc 

(20) 

Letting  Ujj,,  and  be  statistically  independent  zero  mean  white  Gaussian 
processes,  and  recognizing  that  =  0,  the  Kalman  filter  equation  for  (15)  is 


4+.  =  + rti  +  /fu,|r»+,  -  Db,  -  +  r6j)]  (21) 

with  covariance  and  gain  equations 

Ml+i  =  +  r5.r  +  (22) 

KU,  =  MU,H^(HMUxir  +  DSfi-'  +  R)-'  (23) 

PUi=U-KUi«)MUi  (24) 


The  structvire  of  the  overall  adaptive  filter  is  a  bank  of  Kalman  filters  operating 
in  parallel,  with  each  individual  estimate  multiplied  by  its  own  weighting  term 
probability.  This  is  the  basic  structure  as  outlined  in  Magill  [2],  with  some 
modifications  as  will  be  shown.  The  filter  based  on  the  parameter  set  that  most 
closely  matches  the  actual  parameters  of  the  modeled  system  will  have  a  weighting 
term  that  tends  closer  to  one,  while  the  weights  of  the  mismatched  filters  will  tend 
towards  zero. 

Using  Bayes’  rule  and  writing  as  the  weighting  term  becomes 

_/l  C  I  <7  ^  _  p('^fc+l  I  pi^iy^i  I  ^k)  fnr\ 

- -  (25) 

The  denominator  term  is  independent  of  i  and  is  a  scale  factor  that  ensures  that  the 
sum  of  the  weights  (25)  at  each  iteration  is  equal  to  one.  The  second  term  of  the 
numerator  can  be  rewritten  to  explicitly  include  the  sample  time  for  the  bias  and 
covariance  terms 

Pi^iy^i  I  ^k)  —  Pi^k+1  —  ^iy^k+l  —  ^i[^k)  (26) 

Using  Bayes’  rule  and  the  definition  of  marginal  densities,  (26)  can  be  written  as 
M 

p(.b„  S,  1 2  J  =  ii,  =  Si  1 =  5^)  p(5,  =  tj,  S,  =  S,  I  Z»)  (27) 

j=i 

The  following  notation  will  be  used  to  express  the  two  terms  of  the  summation  (27) 

©i.i  =  P{h+i  =  KSk+i  =  Si  1 6fc  =  bjySk  =  Sj)  (28) 

wi  =  p{bk  =  bj,Sk  =  Sj\Z,^)  (29) 

The  density  function  of  (28)  is  a  Maxkov  tr^Lnsitional  probability  matrix  [4];  that  is, 
0j  j  is  the  conditional  probability  that  b  —  bi  and  S  =  S,  at  time  A:  4-1,  given  that 
b  =  bj  and  5  =  Sj  at  time  k.  The  density  function  of  (29)  is  of  the  same  form  as  (25) 
and  is  just  the  previous  weighting  term  at  the  previous  value  of  time  k. 


Necessary  Condition  for  Effective  Estimation 
The  first  term  of  the  numerator  of  (25)  is  of  prime  importance  in  determining 
whether  or  not  the  adaptive  filter  will  work  properly  for  a  given  system.  It  can  be 
modeled  as  a  Gaussian  density  if  the  bias  and  covariance  terms  switch  slowly 
compared  to  the  sample  interval  k.  This  assumption  will  be  made  here  and  has  been 
verified  by  extensive  simulation  and  analysis  by  Moose  and  Wang  [3].  This 
conditional  density  is 


p(zt„  I  =  Nlff(ixi  +  ry  +  Dii,  +  DS,D^  +  ii|  (30) 


Th  measurement  residual  from  (21)  is 

^1+1  =  ^k+i  -  -  H{^^k  +  rJ.)  (31) 

The  mean  and  covariance  of  (30)  axe  readily  available  from  the  residual  (31)  and  the 
Kalman  gain  (23).  Essentially,  the  measurement  residual  of  the  matched  filter  will 
have  the  smallest  expected  value  (typically,  zero  mean),  while  the  residuals  of  all  the 
mismatched  filters  will  be  biased.  Under  the  Gaussian  assumption,  the  probability  of 
the  matched  filter  will  be  the  largest  among  all  the  filters. 

For  some  -"vstems,  however,  the  statistical  steady-state  value  (that  is,  the  steady- 
state  value  oi  expected  value)  of  the  measurem  residual  of  each  filter  goes  to 
zero.  None  oi  ;  residuals  are  biased.  T  -refore,  j  adaptive  system  cannot  detect 
which  filter  ha  he  correct  parameter  set  and  the  weights  all  tend  to  the  same  value. 

The  statistical  steady-state  (SSS)  value  of  the  measurement  residual  is  a  function 
of  the  dc  gain  of  the  system  transfer  function  [5].  If  the  dc  gain  of  the  system  is  zero, 
then  the  SSS  value  of  the  residual  is  zero.  This  condition  provides  a  simple  test  as  to 
whether  or  not  this  adaptive  system  will  work  properly  for  a  particular  system. 

Alternate  Estimation  Procedures 

A  possible  solution  to  this  problem  is  to  first  determine  if  an  alternate 
measurement  provides  a  nonzero  dc  gain.  For  example,  if  in  the  zero  dc  gain  system 
acceleration  measurements  are  talcen,  changing  to  velocity  measurements  may 
provide  a  nonzero  ciin.  If  the  actual  sensorj-  producing  these  measurements 

cannot  be  change  cegrating  the  acceleration  measurement  data  to  produce 

approximate-veil.  tisurement  data  may  jJlow  the  adaptive  system  to  work. 

To  illustrate  this  point,  consider  a  general  modal  structure  containing  modes 
which  are  closely  spaced  in  frequency  and  heavily  damped.  The  input  to  the  system 
is  a  nonGaussian  stochastic  FM  signal,  not  directly  measurable  by  the  system.  The 
model  of  *his  system  follows  (9-11).  Checking  the  dc  gain  of  this  system  using  cin 
acceleration  meaisurement  (with  no  noise) 

G(u  =  =  -^T-\-D  (32) 

gives  us  zero  dc  gain.  Since  the  SSS  value  of  the  measurement  residual  is  a  function 
of  this  dc  gain 


r;.  =  |/  -  HUI  -  (/  +  )$)-‘iii-„|G„{u,  -  6.) 


(33) 


the  residuals  for  all  filters  go  to  zero  when  the  system  reaches  SSS.  Therefore,  the 
residual  density  function  |  bi,Si,Zic)  cannot  discern  which  filter  is  the  matched 

filter  and  all  the  weighting  terms  (25)  become  equal. 

Changing  this  modal  system  to  output  velocity  measurement  data  produces  a 
transfer  function  with  a  nonzero  dc  gain.  Typically,  changing  the  actual  system 
cannot  be  done  in  practice.  However,  given  a  large  signal  to  noise  ratio  between  the 
input  signal  and  measurement  noise,  a  good  approximation  of  the  velocity 
measurement  data  can  be  generated  by  integrating  the  acceleration  measurement 
data  from  the  original  system  using  a  first  or  second  order  discrete  time  integrator. 
Redesigning  the  residual  density  function  to  be  based  on  the  velocity  measurement 
model,  while  actually  using  approximate-velocity  measurement  data,  will  cause  the 
weighting  terms  to  properly  select  the  matched  filter. 

Simulation  Analysis  of  Alternate  Adaptive  Filters 

Several  simulations  were  run  to  test  the  performance  of  the  Gaussian  sum  filter 
under  various  conditions.  One  simulation  implements  the  velocity  measurement 
system  with  nonzero  dc  gain,  and  uses  an  adaptive  filter  design  based  on  the  velocity 
measurement  model.  This  serves  as  a  benchmaxk  for  comparison  with  the  other 
simulations. 

A  second  simulation  implements  the  acceleration  measurement  system  with  zero 
dc  gain.  Two  adaptive  filters  are  used.  One  filter  is  designed  based  on  the 
acceleration  measurement  model,  and  produces  the  individual  state  estimates  from 
(21).  The  second  filter  is  designed  based  on  a  velocity  meeisurement  model,  but  uses 
approximate-velocity  mecisurement  data  generated  firom  a  discrete  time  integrator. 
This  filter  is  used  to  produce  the  proper  weighting  terms  (25).  The  overall 
performance  is  not  as  good  as  the  filter  of  the  first  simulation.  The  second  filter 
could  also  be  used  to  generate  the  individuail  state  estimates,  but  the  performance  is 
no  better  than  the  first  filter  since  the  same  acceleration  measurement  data  is  used  to 
generate  the  approximate-velocity  measurement  data. 

A  third  simulation  implements  a  dual  measurement  system,  where  the 
measurement  is  comprised  of  two  models;  the  acceleration  measurement  model,  and 
the  velocity  measurement  model.  The  adaptive  filter  is  designed  based  on  the  dual 
measurement  model  and  uses  acceleration  measurement  data  plus  approximate- 
velocity  measurement  data.  This  filter  cam  be  tuned  to  perform  better  than  the  filter 
of  the  second  simulation,  but  not  as  well  as  the  filter  of  the  first  simulation. 

Determination  of  Appronmate  Parameter  Values 

As  mentioned  previously,  the  system  cannot  directly  measure  the  input  signal. 
Referring  to  the  stochastic  FM  input,  the  bias  parameters  6,  used  to  model  its  density 


as  a  Gatissian  stun  are  based  on  the  amplitude  of  the  signal.  For  good  performance, 
the  adaptive  filter  must  have  some  approximate  knowledge  of  this  amplitude. 

One  possible  method  of  determining  amplitude  information  [5]  is  to  monitor  and 
process  the  measurement  residual  of  the  zero-bias  filter  (the  individual  filter  with  a 
bias  parameter  value  of  zero)  of  the  weighting  term  adaptive  filter  of  the  second 
simulation  described  previously.  Setting  =  0  in  (31)  gives 

rh,  =  (34) 

The  velocity  measurement  model,  given  below,  of  the  weighting  term  adaptive  filter 
is  of  the  same  form  as  (10),  except  D  =  0 


^k+i  =  Hxt+i  -I-  Ufc+i  (35) 

Using  (9),  (35),  and  setting  the  error  e2  =  (xfc-x2),  the  zero-bias  residual  (34) 
becomes 

=  Hiel  +  HTuk  +  +  ut+i  (36) 

Given  that  the  input  signal  u*  is  much  larger  than  both  tn*.  and  Ufe+i,  and  then 
premultiplying  both  sides  of  (36)  by  the  zero-bias  residual  is  processed  as 


=  (^^r)-V2+i  -  {HTyWel  -h  Ufc  (37) 

If  the  expected  value  of  the  first  term  on  the  right  hand  side  of  (37)  is  close  to  zero, 
then  the  expected  value  of  the  processed  zero-bias  residual  can  be  approximated  by 

E[«*+i]  ^  «ik  (38) 


To  approximate  this  expected  vaJue,  a  smoothed  version  of  is  found  from  a  first 
order  discrete  time  smoother 


aUi=«.S;  +  (l-«.)2*  (39) 

The  peak  values  and  power  of  an  n-sample  window  of  (39)  are  calculated  and  used  to 
periodically  update  the  bias  parameters  in  the  remaining  nonzero-bias  filters  in  the 
adaptive  filter  bank.  This  could  serve  as  a  suboptimal  method  of  tuning  the  bias 
parameters  to  their  “optimal”  values  when  the  input  signal  amplitude  is  originally 
unknown,  or  if  the  signal  makes  large  changes  in  amplitude  at  unknown  times. 
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Abetract. 

A  computational  simpliGcation  of  the  Ho-Kalman  minimal  realization  procedure,  recently  referred  to  as 
the  Eigensystem  Realization  Algorithm  (ERA),  is  proposed.  According  to  the  simplified  algorithm, 
instead  of  performing  the  singular  value  decompontion  (SVD)  of  the  Haokel  matrix  and  multiplying 
with  matrices  contmning  left  and  right  singular  vectors,  a  minimal  system  state  space  representation 
may  be  obtained  by  simple  selecting  appropriate  rows  or  columns  from  the  Hankel  matrix.  The 
obtained  state  representation  is  in  either  Pseudo-Controllable  or  Pseudo-Observable  form. 

Introduction. 

In  their  seminal  papter  [1],  Bo  and  Kalman  suggested  a  procedure  for  calculating  a  minimal  state  space 
representation  R  =  of  a  MEMO  system  from  given  Markov  Parameters  K,  ,  defined  by 

Y-  =  CA'B  ;  i  =  0  1  ,  .  .  .  ,  N.  Since  publication  of  that  paper  a  number  of  papers  and  textbooks, 

[l]-[8],  have  referred  to  this  minimal  realization  procedure  and  have  suggested  slightly  modified 
versions.  Relatively  recently,  this  algorithm  has  been  given  considerable  attention  ,  particularly  in  the 
area  of  large  flexible  structures,  where  it  has  received  a  name  ERA  (Eigensystem  Realization 
Algorithm),  (9]-[12].  In  all  these  papers  different  versions  were  suggested  ,  but  all  of  them  basically 
follow  the  following  procedure. 

1:  Define  Markov  Parameters  ,  i.e.  (pxm)  matrices  K,  ,  of  an  n^”  order  MIMO  system  with  m  inputs 
and  p  outputs. 

2:  Build  the  (rp  x  rm)  Hankel  matrix  H,.  given  by,  ,  = 

3:  Increase  the  r  sequentially,  until  the  rank  of  does  not  increase,  i.e  when  p\H , 
n  =  p  \H^\  represents  the  order  of  a  minimal  state  space  representation 
4:  Perform  the  SVD  of  H 


Y  ..Y 


for  an  arbitrary  integer  r. 


cn 


H,  =  U 


-n" 


o 


0...0 


■ i.e.  calculate  matrices  U,  and  V  satisfying 


where  =  diag|<r,, . > 


0 


5:  The  tnatrices  X,  B,  C  \n  a  minimal  realization  are  given  by  A  =  {E„)'*^*  Uj  H ^  Vj  (E„)**^^, 


C  =  £/n  (SJ 


1/2 


where 


1' .X 


2r- 


and  l/jj  and  Vjj  are  (rpxn),  (rmxn), 

(n  X  m)  and  (p  x  n)  matrices  respectively. 


ffj  contains  the  first  n  columns  from  U  and  Kj  contains  the  first  n  columns  from  V.  Similarly,!/] | 
contains  the  first  p  rows  from  (7j  and  Vjj  contains  the  first  m  columns  from  (Vj)^. 

Simplified  ERA. 

The  computational  simplification  in  ERA  suggested  in  this  paper  is  based  on  the  use  of  pseudo 
controllability /observability  indices,  introduced  recently.  Consequently,  as  explvned  in  [13]-[16],  there 
are  more  equivalent  state  space  representations  in  pseudo  controllable/observable  forms  corresponding 
to  the  given  Markov  parameters,  K,-.  Thus,  after  calculating  the  rank  of  ^,.,(steps  1,2,3  of  the  above 
algorithm),  the  only  calculations  that  should  be  done,  in  the  case  of  representing  the  system  in  a 
pseudo  controllable  form  are  :  (i.)  Select  n  appropriate  columns  from  with  locations  vj  »»'  ; 

<  j  ,  into  the  (rpxn)  matrix  and  select  n  columns  from  with  locations  m  +  vf  ,..,  m  +  v^  , 
into  the  (rpxn)  matrix  Hj.  (ii.)  The  system  matrix  .4,.  in  a  pseudo  controllable  form  is  given  by  the 
least  square  solution  of 

-  ■  ”  (1) 


If  I A 


=  //,  or  .4,  =  (///  H.y'HjH. 

.  -i _  _ L..  a  _  If  :  /I  II 


The  input  matrix  B^  is  fixed  and  is  always  given  by  =  [/„,:  O  ]  ,  while  the  output  matrix  <_  ^ 

contains  the  first  p  rows  from  /fj.  Similarly,  in  order  to  obtain  a  state  space  representation  in  a 
pseudoobservable  form  ,  the  procedure  is  dual  to  the  procedure  for{Aj,Bj,CjJ.  Obviously,  the 
calculation  required  by  (1),  is  much  simpler  than  those  required  in  the  steps  4-5  of  the  original  ERA 
algorithm.  Particularly,  when  the  system  order  is  known  or  assumed  calculation  of  SVD  is  not 
required.  The  n  location  numbers  vj  ....  defining  the  columns  of  to  be  selected  in  matrices  //, 
and  //j  are  uniquely  related  to  the  assumed  set  of  admissible  pseudo  controllability  indices, 
respectively,  (15). 
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Consider  the  example  given  in  {17},  pp  491-492: 
Given  Markov  Parameters  Y-,  i  =  0,  .  .  .  ,  5  : 


Example 


in  Ii7l,  pp  r  ir  ir  ir»ir  If. 

V'-n  0111  12  14  27 

‘  . .  0  0  0  0  j  -1  1  0  1  11  12 


The  order  of  the  minimal  realization  and  the  number  of  inputs  and  outputs  are  n  =  4,  m  =  2  a 
p  =  2.  The  possible  sets  of  pseudo  controllable  indices  are  {1,3}, {2, 2}  and  {3,1}  .  The  Hankel  mat 
H  and  the  locations  {{/‘{^corresponding  to  the  sets  of  ■  oudo  controllability  indices  are  : 


i  I  .p.c.i:  {1,3}  {A2}  {3,1} 

Yj-.-ysl  locations  :{  1,  2,  4,  6  }  {  1,  2,  3,  4  }  {  1,  2,  3,  5  } 

The  matrices  ^  and  H2  formed  using  the  above  location  indices,  as  defined  in  the  earlier  section. 

It  is  verified  that  the  matrix  /f  j  in  the  case  of  p.c.i  {2,2}  is  not  of  full  rank,  therefore,  this  set  of  p.c.i 
is  not  admissible  .  According  to  (1),  the  pseudo  controllable  forms  corresponding  to  pseudo 

controllability  indices  {1,3}  and  {3,1}  are  : 


p.c.i  {1,3}  : 


p.c.i  {3,1}  : 


1001 
2  0  0  2 
•1100 
0011 


0101 
0  2  0  2 
I  -1  0  -1 
0  0  10 


-  -1111 
~  0  0  0  1 


-110  1 
0  0  0  -1 
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